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New sources of sex cells 


Within the next decade or so, it will become possible to derive sperm and eggs from skin cells. The ethical 
and technical hurdles need to be addressed with the scientific and therapeutic benefits in mind. 


a jarring reminder of how conventions in the way humans are 

created could shift. Just a week later, scientists, bioethicists, law- 
yers and journal editors convened in Hinxton, UK, to ponder how 
long it will be until sperm and eggs can be made entirely in a Petri dish 
from, say, skin cells induced to pluripotency. They asked questions 
such as how will these advances transform reproductive research 
and medicine? How might they change society? Is the bioethics 
community prepared? 

These questions have acquired a sense of urgency, at least in Britain, 
where a government bill updating the 1990 Human Fertilisation and 
Embryology Act is working its way through the system. If passed by 
the House of Commons in its present form, the bill would allow basic 
research on human gametes derived in vitro but would not permit 
the use of such gametes for fertility treatment. Opponents may yet 
propose amendments designed to stop all embryo research — includ- 
ing the demonstration that in-vitro-derived gametes are normal by 
seeing whether they are capable of fertilization and further develop- 
ment. The Hinxton group’s consensus statement, arrived at after three 
days of discussion, can be viewed at www.hinxtongroup.org/Consen- 
sus_HG08_FINAL.pdf. 

The organizers scheduled the topic of pluripotent stem cell derived 
gametes (PSCDGs) two years ago. Since then, making induced 
pluripotent stem (iPS) cells has become relatively straightforward, but 
although the technique circumvents the ethical and technical prob- 
lems associated with collecting human eggs and creating embryos, 
there are still serious ethical implications. Indeed, the facility with 
which iPS cells can be derived could make it easier to derive gametes 
from any person, living or dead. 

The potential benefits of PSCDGs are impressive. They could reveal 
much about the mechanism of gamete development, with implica- 
tions for treating infertility, certain cancers and genetic diseases, and 
for facilitating drug tests on gametes. Other potential applications 
present social and ethical challenges: germline genetic modifica- 
tion for the correction of disease mutations or for research; various 


F arlier this month, the world had its first look at a pregnant man, 


forms of biological enhancement; genetic screening of embryos 
for selection; and enabling same-sex couples to have their own 
genetic offspring. 

The Hinxton group believes that eggs and sperm will be generated 
from pluripotent stem cells in 5-15 years, although it concedes this 
is something ofa guess. Technically, a major sticking point is getting 
the developing gametes to undergo meiosis. Ethically, the problem 
is quality control: the only conclusive way to test that gametes are 
functional and can create a viable embryo is to go ahead and do so. 

Once gametes can be made, the immediate question will be whether 
and how embryos derived from them can be created and studied. So 
iPS research will come full circle, and we will be faced again with the 
issue that has dogged us since the birth of Louise Brown, the first IVF 
baby in 1978. Is it moral to use human embryos for research or to 
create human life with the implicit intention of destroying it during 
an experiment? 

Not conducting this research could cause harm: without it, a preg- 
nancy using an embryo produced by PSCDGs would pose an unac- 
ceptable risk for both child and mother. If PSCDGs are to one day join 
the tools of assisted reproduction, functional tests must be developed to 
eliminate faulty gametes. Quality tests developed for IVF will be useful 
in this context and the methods should first be exhaustively assessed 
on animal models, including non-human primates. 

Clearly, appropriate oversight of PSCDG research is needed, but 
much of the framework for doing so is already in place, in the form 
of committees for overseeing stem-cell research and institutional 
review boards for human experimental subjects. Nonetheless, 14 
countries already prohibit or restrict PSCDG technology for genetic 
screening or germline genetic modification, some under penalty 
of imprisonment. Japan prohibits the creation of human PSCDGs 
altogether. Where stricter regulations are needed in many countries 
is on experimental fertility treatments involving human embryos 
carried out in clinics. But for any country considering legislation in 
this area, beware of impeding basic research and losing the benefits 
that could result. a 


The big ome 


It's time to make the case for proteins. 


sometimes stuck with rather mundane labels. Certainly they 

sit in the scientific shadow of the genome sequence and the 
eulogies that inspires. So protein biologists face an uphill battle if they 
are to fire up the research community — and the world beyond — 
enough to buy into a Human Proteome Project. 


/ Ts body’s building blocks; ‘cellular machinery’: proteins are 


The project is just getting past the back-of-the-envelope stage but, 
in essence, it would systematically catalogue all the proteins manu- 
factured in the body: what they are, where they are and in what abun- 
dance (see page 920). A cancer biologist might reveal whether a rogue 
protein is overexpressed in the tumours she studies compared with 
levels from healthy tissue that are logged in the proteome register. A 
geneticist who traces Alzheimer’s susceptibility to a region of code 
could consult the proteome to reveal which proteins are being manu- 
factured from that region in the brain. We can expect this catalogue of 
proteins to eventually include the targets for almost all future drugs. 

There are many obvious parallels with efforts to elucidate the 
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human genome. A human proteome would be a very expensive and 
ambitious undertaking and, by its nature, the full benefits cannot be 
spelt out beforehand. That's how it should be. The fun of the human 
genome is that spelling out the letters did little to decipher the code — 
there is so much more complexity than had ever been imagined. We 
can expect that the proteome will also reveal unexpected delights 
about the ways in which proteins rally together to perform a task or 
become tissue. 

And yet there was a certain intellectual allure about ‘cracking’ the 
human genome that, on the face of it, is lacking from cataloguing 
all the proteins. And although the human genome is finite, the pro- 
teome is almost boundless, because each of the body’s proteins may be 
present in different forms and different amounts in each tissue — and 
even in precisely the same cell from one moment to the next. That 
very complexity, however, should inspire, not dissuade. 

Proteomic analyses have also been viewed with some scepticism, 
in part because many studies involving mass-spectrometry profiling 
have proved difficult to reproduce. And the field has so far largely 
failed to deliver the disease-tracking biomarkers on which these 
early efforts were sold. There are few examples of clear, clinically 
proven benefits. Starting out with this kind of reputation will make 


a Human Proteome Project particularly hard to sell. 

The Human Proteome Organisation (HUPO) has taken some 
praiseworthy steps towards resolving these issues with, for exam- 
ple, a project to show that different labs can now produce identical 
results from the same sample. With the rapid evolution of proteomic 
techniques, the field’s reputation and utility is likely to pick up. But 
HUPO — and the proteomics community — still has a lot to prove 
and a successful Human Proteome Project is its chance to prove it. 
It needs to consult widely to devise a strategy that has strong com- 
munity backing. It will also need to demonstrate that the knowledge 
stemming from this project will transform the research landscape in 
the same way that the genome sequence has done. 

Failing these, either the project will die or HUPO could risk being 
left on the sidelines as organizations with money to spend make the 
decisions about how proteomics should be done. 

Times have changed since the Human Genome Project — propos- 
als for mega-biology projects are rather more common and money 
scarcer. HUPO, and all biologists who love proteins, should articulate 
clearly and loudly the benefits of sinking US$1 billion into protein 
biology. It will take much work to get due recognition for the cell's 
work-horses: 2 


Superconductors redux 


Yet another surprise has been uncovered in the 
complex oxides. 


ith the discovery of a new class of high-temperature super- 

VV conductors by reseachers in Japan (see page 922), history 

seems to be repeating itself. In 1986, Georg Bednorz and 

Alex Miiller of IBM’s Zurich research laboratories discovered that 

a complex oxide of barium, lanthanum and copper became super- 

conducting at 35 K. This sparked an orgy of research that led to the 

discovery ofa related compound (yttrium barium copper oxide) with 

a superconducting transition temperature of 90 K: high enough to 

be attained with relatively cheap liquid-nitrogen cooling. It also won 
Bednorz and Miiller a Nobel prize just a year later. 

The excitement stemmed from the prospect of exploiting low-cost 
superconductivity for loss-free electrical transmission, magnetic levi- 
tation and other dazzling applications. In superconductors, currents 
flow essentially without electrical resistance, the source of energy loss 
through heating. Before 1986, most superconductors were metals and 
alloys, with generally paltry transition temperatures that no one had 
managed to push above 23 K. 

Now Hideo Hosono of the Tokyo Institute of Technology and his 
colleagues have shown that another complex oxide, containing lan- 
thanum, iron, arsenic and a little fluorine, will superconduct at 43 K 
when squeezed by around 40,000 atmospheres pressure: a higher 
temperature than anything bar the copper oxides (see H. Taka- 
hashi et al. advance online publication: doi:10.1038/nature06972). 
Like them, the new material has a sandwich structure of alternate 
conducting and insulating layers. And like them, doping (replac- 
ing some oxygen with fluorine) injects electrons into the conducting 
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layer that contribute to the supercurrent. 

The copper-oxide materials have found some uses, but nothing to 
match the expectations heaped on them in the late 1980s — levitat- 
ing trains and so forth. It has proved hard to fashion these brittle 
materials into wires and progress is slow. So cynics might grumble 
that the new breakthrough will merely renew the same unfulfilled 
promises. 

But the new compound already offers more. For one thing, it reveals 
how much remains to be discovered about complex solid-state com- 
pounds. The combinatorial possibilities for four or more elements 
are so vast that we have barely scratched the surface, despite efforts 
to automate the search. And as before, the discovery followed from 
sound chemical intuition. Bednorz and Miller were led to the copper 
oxides from the apparently unpromising strontium titanium oxide, 
a superconductor at a mere 0.3 K, by reasoning what kind of crystal 
chemistry might boost the requisite interactions between electrons. 
Hosono and his colleagues similarly picked a systematic path from 
their initial discovery, in 2006, of superconductivity at about 4K ina 
related material, a temperature so low that it attracted little interest. 
They raised this to 26 K by the start of 2008, and rightly figured that 
squeezing would take it further. 

Most crucially, the ‘iron oxypnictides’ show that high-tempera- 
ture conductivity is not the sole preserve of copper oxides. As in 
that case, superconductivity in the new materials seems to be related 
to magnetic behaviour. But quite how this works has remained a 
mystery. With an entirely new family of compounds to play with, 
the mechanism might be persuaded to start giving up some secrets. 
With a theory to hand, ‘designer superconductors’ with much higher 
transition temperatures might not look like a fool’s quest. There could 
be another Nobel prize in that. For now, it is enough that the oxy- 
pnictides have set the community buzzing in a way that recalls the 
last heyday of superconductors two decades ago. 2 
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Mercuric swallows 
Science 320, 335 (2008) 


High levels of mercury in rivers present a broader 
threat to ecosystems than previously thought, 
according to Daniel Cristol and his colleagues at the 
College of William and Mary in Williamsburg, Virginia. 
They found that mercury that entered the headwaters 
of the Shenandoah River in Virginia between 1930 and 


1950 has accumulated not only in aquatic animals 
and the creatures that eat them, but also in birds that 
exclusively gobble terrestrial insects and spiders. 
Blood samples from 12 out of 13 species of terrestrial- 
feeding birds breeding near the river contained notably 
higher levels of mercury than samples the team 
collected from the same species at uncontaminated 
sites. But whether aquatic insects venturing onto land 


or historical floods that deposited the pollutant are 


responsible for its spread remains unclear. 


Unblinking dots 


J. Am. Chem. Soc. 130, 5026-5027 (2008) 
Quantum dots are semiconductor 
nanocrystals that emit light, making them 
suitable for such things as medical imaging, 
but their usefulness is undermined bya 
tendency to blink or even go out when 
their surface chemistry is disrupted. 
Encasing quantum dots in a thick alloy 
shell renders their optical behaviour much 
more robust, write Jennifer Hollingsworth 
and her colleagues at Los Alamos National 
Laboratory in New Mexico. 

The team slowly grew many thin layers 
of cadmium sulphide, zinc sulphide and 
cadmium zinc sulphide on cadmium selenide 
quantum dots that were 3-4 nanometres 
in size until the particles reached 15-20 
nanometres across. This altered the colour 
of the emitted light and increased the 
fraction of time for which the dots glowed, 
an improvement that should facilitate new 
applications in biology and optoelectronics, 
the researchers say. 


Feeling the air 


Cell 133, 223-234 (2008) 

The process by which mice adapt to low 
oxygen levels is mediated at least in part by 
their skin, report Randall Johnson at the 
University of California, San Diego, and his 
colleagues. 

When they knocked out the ability of mouse 
skin cells — but no other cells — to make a 
protein associated with adaptation to hypoxia, 
the researchers found that an adaptation 
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mediated by the kidneys was greatly reduced. 
It seems that, normally, when oxygen levels are 
low, the protein alters blood flow from the skin 
to the kidneys and encourages the production 
of erythropoietin (EPO). This tells the body 

to build red blood cells — a useful response to 
low oxygen availability. 

The work demonstrates a new drug-based 
mechanism for stimulating EPO production, 
as well as an intriguing role for skin as an 
independent environmental oxygen sensor, 
the authors say. 


Stencilissimo 


Nano Lett. doi:10.1021/nl080033t (2008) 
Engineers at the University of Cambridge, UK, 
and the Nokia Research Centre in the same 
city have stenciled two- and three-dimensional 
patterns onto the surface of a silicon wafer 
using silicon oxide nanowires as a mask. 
Andrea Ferrari and his co-workers first 
attached the nanowires to the surface ofa 
silicon wafer with nickel leads. They then 
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etched the wafer chemically, which left 
extremely thin lines of silicon under the 
nanowires. 

Were the nanowires to be arranged in 
preset patterns — a task that might be 
achieved with microfluidic techniques — the 
etching process would carve nanoscale- 
circuits into crystalline silicon. 


Darwinian chips 


PLoS Biol. 6, 85 (2008) 

Researchers in California have demonstrated 
computer-controlled natural selection of an 
RNA molecule in real time. 

Brian Paegel and Gerald Joyce of the Scripps 
Research Institute in La Jolla took an RNA 
molecule that can chemically stitch itself to 
other RNA fragments and reproduce itself as a 
result. They then loaded this catalyst and some 
suitable RNA fragments into a chip that can 
hold minute volumes of liquid, and connected 
this toa computer. The computer diluted the 
growing population and resupplied the RNA 
fragments at ever lower concentrations. 

Over time, the system created a selection 
pressure for more efficient RNA catalysts. 
After 500 dilutions, the RNA catalyst had 
accumulated 11 mutations, and performed 90 
times faster than the original catalyst. 


A misunderstood microbe 


Science 320, 382-385 (2008) 

The flagellar filaments of Campylobacter 
jejuni, a major cause of bacterial diarrhoea 
worldwide, are composed of seven 
protofilaments. The same anatomical 
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parts contain 11 protofilaments in all other 
bacteria studied so far. 

Edward Egelman of the University of 
Virginia in Charlottesville and his colleagues 
uncovered this break from tradition when 
they examined the species’ flagella filaments 
using electron cryo-microscopy. They 
propose that C. jejuni’s unusual filament 
structure arose from a minor change in a 
region of the protein that is responsible for 
the helical symmetry of flagella filaments. 


MOLECULAR BIOLOGY 


Arsenic's action 


Nature Cell Biol. doi:10.1038/ncb1717 (2008); Nature 
Cell Biol. doi: 10.1038/ncb1716 (2008) 
Arsenic trioxide, a poison familiar to readers 
of Agatha Christie, has been used to treat acute 
promyelocytic leukaemia since 1992, although 
how it works has never been fully explained. 
Two studies have now pin-pointed 
RNF4, a member ofa family of 
enzymes called ubiquitin ligases, as 
the reason why the drug destroys 
PML-RARa, a protein produced in 
patients with this cancer. 

It was already known that arsenic 
trioxide attaches several copies 
of SUMO, a signalling molecule, 
to PML-RARa, and that this 
addition prompts the destruction 
of PML-RARa. Ronald Hay of the 
University of Dundee, UK, Hugues 
de Thé of the University of Paris 
and their respective colleagues 
have shown that the SUMO chains 
are recognized by RNF4, which 
directs PML-RARa to an organelle called the 
proteasome, where PML-RARa is chewed up. 


PALAEONTOLOGY 
Amphibious origins 


Proc. Natl Acad. Sci. USA 105, 5786-5791 (2008) 
Isotopes in the teeth of elephants’ extinct 
relatives are adding weight to the arguments 
of researchers who think that the Eocene 
proboscideans from which elephants evolved 
lived in freshwater. 

Alexander Liu at the University of Oxford, 
UK, and his colleagues measured the oxygen 
isotopes in enamel taken from the teeth of 
two late-Eocene genera, Barytherium and 
Moeritherium. These genera are closely 
related ‘sister taxa’ of modern elephants. The 
isotopic composition of their teeth matched 
those of many other aquatic and semi-aquatic 
mammals, making a case for elephants and 
sirenians — that is, manatees and dugongs 
— sharing an amphibious ancestor that lived 
about 50 million years ago. 


CRYSTALLOGRAPHY 


Probing a crystal 


Phys. Rev. Lett. 100, 145502 (2008). 
A way to distinguish molecular left- 
and right-handedness (chirality) has 
been demonstrated using the SPring-8 
synchrotron X-ray source in Harima, Japan. 
Yoshikazu Tanaka of the RIKEN SPring-8 
Center in Sayo, Japan, and his co-workers 
have found that the instrument's bright beams 
of circularly polarized X-rays create disparate 
diffraction patterns for the two chiral forms 
of quartz. Normal X-ray diffraction gives 
identical patterns for both versions. When the 
angle at which the circularly polarized beam 
hits the crystals varies between the chiral 
forms, the intensity at which it is reflected 
differs. The method can directly ascertain 
chirality when measuring the rotation of 
polarized light cannot. 


ASTRONOMY 


Galactic adolescence 


Astrophys. J. 677, 943-956 (2008) 

The Spitzer Space Teloscope (picture above) 
has detected about 2,600 ancient galaxies 
undergoing a growth spurt. These galaxies 
are about ten billion light years from Earth 
and hidden from ground-based telescopes by 
their own silicate dust, which reddened their 
light in the same way that our atmosphere 
reddens sunsets. 

Arjun Dey of the National Optical 
Astronomy Observatory in Tucson, Arizona, 
and his collaborators used the Spitzer to detect 
the infrared glow of the galaxies, finding a 
subset of galaxies that formed hundreds or 
thousands of stars per year and that had fast- 
expanding black holes at their centres. 

The team believes that this rapid growth 
lasted for 100-200 million years — a mere 
blip in galaxy evolution — and that these far- 
off galaxies went on to resemble the massive 
galaxies close to our own. 
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RESEARCH HIGHLIGHTS 


JOURNAL CLUB 


Vijay Kuchroo 
Harvard Medical School, Boston, 
Massachusetts 


An immunologist muses about 
inflammation through cell 
interactions. 


I spend my lab hours trying to 
understand what prompts T cells 
—atype of white blood cell — to 
specialize. Some T cells produce 
soluble molecules that rattle the 
immune system into an inflamed 
state; other cells generate 
molecules that calm the system 
back down. 

Upon infection, cells such as 
macrophages — another type of 
white blood cell — produce soluble 
molecules called interleukins that 
direct the fate of the responding 
T cells. An emerging curiosity 
in the field is which interleukins 
make certain T cells become 
pro-inflammatory, and which 
cause other T cells to become 
anti-inflammatory. This decision 
is crucial for determining whether 
an immune response induces or 
suppresses inflammation. 

Recently, investigators have 
turned their attention towards 
an interleukin known as IL-27. 
This is produced by activated 
macrophages and was initially 
thought to induce IFNy, a 
signalling molecule that activates 
macrophages even more. 

But work by Nico Giraldi and his 
colleagues at Genentech in South 
San Francisco, and other groups, 
has recast IL-27 as a molecule 
that primarily directs T cells to 
suppress inflammation. In a paper 
published in March, Giraldi’s 
team confirmed that IL-27 acts in 
this way because it causes CD4+ 
and CD8+ T cells to make the 
anti-inflammatory IL-10, and does 
not work through an alternative 
pathway (M. Batten et al. 

J. Immunol. 180, 2752-2756; 2008). 
Mice with Listeria infections or 
autoimmune tissue inflammation 
in their brains and spinal cords 
generated fewer IL-10-producing 

T cells when they lacked an 

IL-27 receptor. Whether an 
analogous interaction occurs in 
humans is not known, but, if it does 
occur, this research could become 
medically useful. 


Discuss this paper at http://blogs. 
nature.com/nature/journalclub 
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NEWS 


Data show extent of sexism in physics 


Women are poorly represented in physics, mak- 
ing up just 10% of faculty in the United States, 
for example, but the reasons for this have proved 
contentious. Now a particle physicist claims to 
have hard data showing institutional sexism 
at an experiment at one of America’s highest- 
profile physics labs. 

Sherry Towers claims that female postdocs 
worked significantly harder than their male 
peers but were awarded one-third as many con- 
ference presentations proportionally. “There 
was this shocking difference,’ says Towers, who 
now studies statistics at Purdue University in 
West Lafayette, Indiana. “Particle physics really 
hasn't moved forward in 30 years.” 

Towers used data from publicly available work 
records to chart the careers of 57 postdoctoral 
researchers, including nine women, who worked 
on the ‘DZero’ particle detector at Fermilab in 
Batavia, Illinois, between 1998 and 2006. Tow- 
ers herself worked as a postdoc on the project 
between 2000 and 2005. The findings of her 
survey were striking, she says. She claims that 
women did 40% more maintenance work 
than their male counterparts, and that female 
postdocs produced significantly more ‘internal 
papers per year. But based on that productivity 
they were only one-third as likely to be allocated 
conference talks as their male peers, she claims 
(http://arxiv.org/abs/0804.2026). 

Conference presentations are critical to a 
young particle physicist’s career. Papers from 
collaborations such as DZero have hundreds 
of authors in alphabetical order. Being given 
the chance to present results at a meeting is a 
major way for young researchers to stand out. 
“Tt’s important,’ says Pauline Gagnon, a physicist 
with the ATLAS detector at CERN near Geneva, 


Switzerland. “Being able to give 
talks is a way of rewarding indi- 
viduals for their work” 

Most particle detectors have 
internal committees that allo- 
cate conference presentations 
to researchers. These commit- 
tees are frequently male-domi- 
nated, and Towers believes this 
lies behind the discrimination. 
“I dont think for a second that 
there is a conscious bias going 
on,’ she says. But the commit- 
tees “are in danger of being 


Sherry Towers assessed work data 
for an experiment at Fermilab. 


address the problem in its own 
speakers’ committee. 

After Towers complained, 
Fermilab launched an inter- 
nal review in autumn 2006, 
says Bruce Chrisman, the 
lab’s chief operating officer. 
An edited copy of the review 
obtained by Nature found that 
the collaboration “followed 
its policies correctly”. But the 
investigator, a senior female 
physicist, added that com- 
plaints of gender discrimina- 


prone to patronage and crony- 

ism”. Male committee members are more likely 
to nominate male protégés to receive presenta- 
tion time, she claims. 

Some are sceptical of the findings. “I wasn’t 
convinced that the effect she has found is real; 
says Kevin Pitts, a particle physicist at the Uni- 
versity of Illinois at Urbana-Champaign. Inter- 
nal papers are not necessarily a direct measure 
of productivity, he argues, and the small number 
of physicists surveyed is not enough to prove 
systematic bias. But Pitts is quick to add that he 
has little doubt that females do suffer gender dis- 
crimination: “In fact,” he says, “T have personally 
observed this on more than one occasion.” 

Female physicists contacted by Nature said 
Towers’ data matched their personal experiences 
of institutional sexism in physics. “You often seea 
young guy with an older guy gossiping and having 
coffee, but never a woman,’ says Freya Blekman, a 
physicist on the CMS experiment at CERN. “I'm 
convinced,’ agrees Gagnon. “There is absolutely 
no shadow ofa doubt in my mind.” She says the 
ATLAS collaboration is thinking about how to 


tion in the group “should not 
be summarily dismissed”. There was a general 
feeling that females were being “passed over” 
for leadership roles, the report says. 

DZeros leaders counter that bias, if it ever 
existed, is not plaguing the current collabora- 
tion. A survey of data between August 2006 and 
2007 showed that women gave 17% of all talks 
despite making up just 12% of the collaboration, 
says DZero spokesman Dmitri Denisov. 

Powers says the investigation didn't focus 
on postdocs and hasn't led to real changes at 
DZero. She wants the conference allocation sys- 
tem to be made more transparent and balanced. 
“The changes that need to be made are simple,” 
she says. “It wouldn't cost them a dime.” 

And Towers says gender discrimination ulti- 
mately forced her out of particle physics. She 
adds that in 2004 her former employer, a promi- 
nent northeastern public university, tried to ter- 
minate her contract after she complained that 
she wasnt given adequate maternity leave. She 
has since filed a lawsuit against the university. m 
Geoff Brumfiel 


M. JONES 


Italian group claims to see dark matter — again 


Physicists in Italy claimed last week to have seen 
particles of dark matter. Their announcement has 
got their rivals riled and raises questions about 
what constitutes evidence of a new particle. 

Rita Bernabei of the National Institute of 
Nuclear Physics in Rome presented her team's 
latest results on 16 April at an international 
meeting of particle physicists in Venice, Italy. 
Their detector, DAMA/LIBRA (Dark Matter Large 
Sodium lodide Bulk for Rare Processes), located 
deep under the country’s Gran Sasso mountain, 
seems to be observing dark matter, Bernabei says. 

Most agree that the experiment is picking up 
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something: “They're seeing a signal, there's no 
doubt about that,” says Tim Sumner of Imperial 
College London. But despite this, critics say 

that they don't believe the detector has found 
the elusive particles. “For me, it's not proof that 
they have seen dark matter,” says Gilles Gerbier, 
a physicist at the Centre for Atomic Energy in 
Saclay, France. He adds that he’s stumped by 
what's causing the signal. 

Dark matter is believed by most physicists and 
astronomers to make up some 85% of the matter 
in the Universe. Most theories predict that it is 
some form of massive particle that interacts very 
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rarely — if at all — with regular matter such as 
atoms. To date, most believe that dark matter 
has been spotted only indirectly via its pull on 
rotating galaxies and its effect on the shape of 
the early cosmos. 

It's not the first time that Bernabei’s team has 
made this claim. In 2000, they also claimed to 
have directly observed dark matter. The team uses 
ultrapure sodium iodide crystals, which theory 
predicts will give off flashes of light when they 
are struck by dark-matter particles. After several 
years of collecting data in the late 1990s, the 
group Saw an increase in the number of flashes 


GUT REACTIONS 
Analytical technique shows 
how metabolism varies 
between populations. 
www.nature.com/news 


Swiss ‘dignity’ 
When it comes to the ethics of experimenting 
on living subjects, plant biologists have had 
cause for a certain smugness. But perhaps no 
longer in Switzerland. 

The Swiss federal government's ethics 
committee on non-human biotechnology 
has mapped out guidelines to help granting 
agencies decide which research applications 
deeply offend the dignity of plants — and 
hence become unfundable. 

Although most people might be 
bewildered that a discussion on how to 
define ‘plant dignity’ should be taking place 
at all, the stakes for Swiss plant scientists 
are high. The Gene Technology Law, which 
came into effect in 2004, stipulates that ‘the 
dignity of creatures’ should be considered 
in any research. The phrase has been widely 
criticized for its general woolliness, but it 
indisputably includes plants. 

All plant biotechnology grant 
applications must now include a paragraph 
explaining the extent to which plant dignity 
is considered. “But scientists don’t know 
what it means,’ says Beat Keller of the 
Institute of Plant Biology at the University 
of Zurich who is running the first field trial 
— of disease-resistant corn (maize) — to be 
approved under the new legislation. 

“At the moment not even authorities who 
decide on grants know what the ‘dignity of 
plants’ really means,” says Markus Schefer, 

a constitution lawyer at the University of 
Basel and a member of the ethics committee. 
“That’s why we were asked to deliberate.” 

The constitution says that the ‘dignity of 
creatures’ must be taken into account in the 
gene-technology arena, which is why the 
term has been adopted into the regulations. 


aw is threat to plant biology 


Living creatures trapped in undignified conditions? A tricky ethical question for Swiss plant biologists. 


The government called on the advice of 
its ethics committee two years ago to help 
develop a definition for plants. “My first 
reaction was — what the heck are we doing 
considering the dignity of plants,” says 
Schefer. “But this very broad provision exists, 
and we have to help to prevent a legal mire.” 
The committee has created a decision 
tree presenting the different issues that 
need to be taken into account for each 
case. But it has come up with few concrete 
examples of what type of experiment might 
be considered an unacceptable insult to 
plant dignity. The committee does not 
consider that genetic engineering of plants 
automatically falls into this category, but 
its majority view holds that it would if 
the genetic modification caused plants to 
‘lose their independence’ — for example 
by interfering with their capacity to 
reproduce. The statement has confused 
plant geneticists, who point out the contrast 
with traditional plant-hybridization 


technologies, for example in roses, which 
require male sterility, and the commercial 
development of seedless fruits. 

Keller sees the issue as providing another 
tool for opponents to argue against any form 
of plant biotechnology, which is already 
very difficult to conduct in Switzerland. 
Schefer says that things will start to become 
clearer when legal challenges to specific 
research projects come to court, and case 
law becomes established. 

The definition of what constitutes dignity 
in animals is currently being tested in a 
Zurich court. Primate-research projects 
at the ETH Zurich technology institute, 
which involve separating young marmosets 
from their mothers, have been put on hold 
while the court decides if they conflict with 
the animals’ dignity. A ruling is expected 
this year. Whichever way it falls, the 
decision is likely to end up in the federal 
constitutional court. a 
Alison Abbott 


every June and a decrease during December. The 
seasonal oscillation, they claimed, was due to 
Earth's annual motion with and against a stream of 
dark matter swirling around the Milky Way. 

But rival groups failed to see the signal and 
heaped criticism on the group. They demanded 
proof that the Italians’ detection was not caused 
by systematic errors such as natural radiation or 
glitches in their electronics. 

This time around, the DAMA/LIBRA team 
seems to have addressed some of those 
concerns, according to Bernard Sadoulet, an 
astrophysicist at the University of California, 
Berkeley. For example, their new detector uses 
larger, less strongly radioactive crystals, and 
they have proven the stability of their detectors. 


But once again, DAMA/LIBRA's rivals are 
coming up empty-handed. Sadoulet's own 
Cryogenic Dark Matter Search II detector 
announced a null result earlier this year (see 
Nature 452, 6-7; 2008). And other second- and 
third-generation experiments are similarly failing 
to see a signal. If the signal DARMA gets is dark 
matter, Sadoulet says, “it's certainly not the dark 
matter we were looking for”. He also continues 
to criticize the group for failing to share enough 
details of its data. “There has not been enough 
information provided to the community,” he says. 

Bernabei defends her group's finding. Most 
other detectors try to measure direct collisions 
between a dark-matter particle and an atomic 
nucleus, she says. DAMA/LIBRA is the only 
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experiment with the years of data needed to see 
the annual fluctuation in dark-matter particles. 
And the fluctuation matches “all the several 
requirements of dark matter", she says, adding 
that two papers have just been posted on the 
popular preprint server ArXiv, which provide the 
details of her work. 

But unless another team sees something, 
Gerbier says, it's unlikely that the wider physics 
community will accept the Italian claim. “There 
should be different ways of seeing the same 
thing,” he says. “A single experiment cannot 
discover dark matter.” 

Bernabei agrees. Proving her team's findings 
will take “time and patience", she says. |] 
Geoff Brumfiel 


919 


ANATOMICAL TRAVELOGUE/SPL| 


J. RICHARDSON/CORBIS 


Biologists initiate plan 


to map human proteome 


Ambitious plans to catalogue and characterize 
all proteins in the human body — a Human 
Proteome Project — are being drawn up by a 
small group of researchers. But with a price tag 
of around US$1 billion, some question whether 
the organizers can raise enough money or 
momentum for such an undertaking. 

Researchers looked into the idea in the mid- 
1990s as the Human Genome Project was tak- 
ing shape — the human proteome seemed a 
natural successor. However, a coordinated 
effort to index human proteins never emerged. 
One reason is that the scale and complexity of 
the problem proved daunting and nebulous. 
Protein-coding genes in the body can make 
tens of different versions of a protein, and each 
of these can be modified by the addition of 
chemical groups in countless different ways. 
All these proteins are being manufactured at 
differing levels, and at different moments in 
time, in the 200 or so types of human cell. “It 
was thought to be beyond comprehension,’ says 
John Bergeron of McGill University in Mon- 
treal, Canada, former president of the Human 
Proteome Organisation (HUPO). 

Now Bergeron and a group of leading pro- 
teomics researchers are putting together a pro- 
posal for a large-scale assault on 
the human proteome. It would 


the cell each of those proteins is 

located and which other proteins each is inter- 
acting with. (The human genetic sequence, 
by contrast, shows which regions code for 
proteins but not which are actually making 
them.) Proponents say that this type of pro- 
tein catalogue will be invaluable in revealing 
new drug targets or biomarkers to track the 
progression of disease. The cost “is absolute 
peanuts when you consider the importance 
of mapping the building blocks of life’, says 
Mathias Uhlen at the Royal Institute of Tech- 
nology in Stockholm, Sweden, who is helping 
shape the new project. 

Two preliminary workshops have been held 
to discuss the endeavour, most recently in Bar- 
bados in January this year. The group plans to 
consult with the wider proteomics community 
for the first time at HUPO’s Amsterdam World 
Congress in August. 

Those involved in the draft plan say that a 
human proteome project is now feasible partly 
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“It willbe a battle to 


reveal which proteins are convince people this 
present in each tissue, where in is worth funding.” 


because estimates of the number of protein- 
coding genes have shrunk. It was once thought 
that there might be around 50,000 or 100,000, 
but now, just 21,000 or so are thought to exist, 
making the scale of human proteomics more 
manageable. And the group plans to focus on 
only a single protein produced from each gene, 
rather than its many forms. “We got rid of all 
this complexity,’ Bergeron says. “We tried to 
craft a project that would be doable with easy- 
to-track milestones.” 

The plan is to tackle this with three different 
experimental approaches. One would use mass 
spectrometry to identify proteins and their 
quantities in tissue samples; another would gen- 
erate antibodies to each protein and use these 
to show its location in tissues and cells; and the 
third would systematically identify, for each 
protein, which others it interacts with in pro- 
tein complexes. The project would also involve 
a massive bioinformatics effort to ensure that 
the data could be pooled and accessed, and the 
production of shared reagents. 

Bergeron envisages the work being div- 
vied up between labs around the world. He 
says that the first stage of the project — which 
involves amassing existing mass spectrometry 
proteomic data — would take 
around six months, and that 
this would be followed by a pilot 
project lasting one to three years 
to do more comprehensive anal- 
ysis of all the proteins manufactured by chro- 
mosome 21, the smallest human chromosome. 
The whole effort could take a decade. 

“It’s a huge undertaking for HUPO and 
they've never done anything like it” says Paul 
Tempst, an expert in proteomics at the Memo- 
rial Sloan-Kettering Cancer Center in New 
York. The organization has run several large- 
scale proteomics efforts, such as those to cata- 
logue the proteome in human blood plasma, the 
liver and the brain. But results from the Human 
Plasma Proteome project and other proteomics 
efforts showed that different laboratories — and 
even the same lab — often identify very differ- 
ent sets of proteins from exactly the same sam- 
ple. “It told us that there were lots of proteins 
in plasma and that if you do it in different labs 
without any effort at technology standardiza- 
tion you get different results,” says Tempst. 

However, Bergeron and others say that 
improvements in mass spectrometry tech- 
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Heart proteins seen separated in gel. 


niques have resolved many of the problems 
of reproducibility that have dogged the field, 
and that it is now possible to reliably identify a 
range of proteins if a sample is analysed many 
times. But an additional difficulty comes in 
trying to analyse samples containing proteins 
at wildly differing concentrations, with some 
present in only a few copies. The three-pronged 
approach is designed so that shortfalls in one 
technique can be made up by the others. 
Steven Carr, director of proteomics at the 
Broad Institute in Cambridge, Massachusetts, 
says there is likely to be broad support for a 
large-scale proteomics effort, but much debate 
about how best to do it. Rather than analyse the 
proteome of one chromosome, he says it may be 
better to tackle the proteome of mitochondria 
or the cell membrane because it would reveal 
more about biology and diseases related to 
those structures. “It’s time to think about some- 
thing in a systematic fashion — whether this is 
the project is a different question,” he says. 
Coordination could also be a challenge. 
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Compared with the human genome project, in 
which a handful of sequencing centres did the 
bulk of the work, a human proteome project 
would involve many more labs, with inevitable 
issues about data-sharing and competition. “It 
will be a battle to convince people this is worth 
funding,” says Michael Washburn, director of 
proteomics at the Stowers Institute for Medical 
Research in Kansas City, Missouri. 

Those involved in the budding project say 
that they must now muster support from fund- 
ing organizations such as the US National 
Institutes of Health and the European Com- 
mission — a clear challenge in today’s tight 
funding climate. “It will be a tough sale,’ says 
Sudhir Srivastava, head of the Cancer Biomar- 
kers Research Group at the US National Can- 
cer Institute in Rockville, Maryland. Srivastava 
says that the project may have a better chance of 
success with a stronger focus on diseases such 
as cancer. “You need to show clinical utility of 
the approach before launching a mega-project,” 
he says. 

Helen Pearson 
See Editorial, page 913. 
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Buckyballs give flash a boost 


Flash memory, the workhorse of mobile 
phones and digital cameras, could be 
made more efficient by using buckyballs. 
These spherical fullerene molecules, 
comprised of 60 carbon atoms, would 
allow flash memory to operate at a lower 
voltage and save on power, researchers 
reported last week. 

“We're the first ones trying to borrow 
molecular electronics concepts and 
put them into non-volatile memory,” 
says electrical engineer Tuo-Hung 
Hou of Cornell University 
in Ithaca, New York, who 
led the research (T.-H. Hou 
et al. Appl. Phys. Lett. 92, 

153109; 2008). 

Most desktop computers 
use a type of random access 
memory (RAM) that is lost 
when the power is shut off. Non-volatile 
memory, however, keeps its content even 
without power, and flash has become the 
dominant type. Flash memory holds its 
zeros and ones in a circuit that contains an 
island transistor, insulated by a thin layer 
of silicon oxide. 

To write or erase the memory, current is 
pumped across the barrier, either injecting 
electrons into the transistor or sucking 
them out. The charge stays put — most 
flash memory is guaranteed to last for 
ten years — but electrons eventually leak 
through the barrier. The barrier has to be 
thick enough to prevent leakage, but thin 
enough for current to pass through during 
awrite or erase. 

The necessary current is relatively 
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high, which translates to relatively high 
voltages. Cycle after cycle, the voltage can 
wear down the flash memory circuits. It 
also requires peripheral circuitry — which 
takes up precious space — to boost the low 
voltages available from batteries or USB 
ports. And most importantly, it wastes 
power that could otherwise extend battery 
life. “The major bottleneck of the current 
flash memories is the voltage,” says Hou. 
Enter the buckyball. By adding 
buckyballs to the barrier layer, the Cornell 
engineers create resonances 
that amplify the current during 
the high-voltage write or erase 
phase. So the voltage needed 
during writing or erasing 
is lowered, by an order of 
magnitude or more. “It’s a very 
interesting twist,” says Sanjay 
Banerjee, an electrical engineer at the 
University of Texas at Austin, who was not 
involved in the research. 
Chenming Hu, an electrical engineer 
at the University of California, Berkeley, 
says that the concept is attractive, and 
could help extend the tenure of flash as the 
dominant form of non-volatile memory. 
But he cautions that other types are already 
nipping at the heels of flash, such as 
magnetic RAM, which stores information 
using the polarity of ferromagnetic plates, 
and phase-change RAM, which relies 
on switching between amorphous and 
crystalline phases of tiny glass filaments. 
“The general feeling is something else will 
replace flash,” he says. 
Eric Hand 


molecules such 
as these could 
be used to make 
flash memory 
drives less 
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Spherical carbon 


power-hungry. 
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Arsenic heats up iron superconductors 


A new class of high-temperature superconduc- 
tors has been discovered in a breakthrough 
that once again has the scientific community 
whispering that economically feasible mag- 
netic levitation and lossless energy transmis- 
sion may be possible. 

First discovered in 1911, superconduc- 
tors can conduct electricity with virtually no 
resistance and can maintain a current without a 
voltage being applied. But they initially offered 
little practical application — most of the metals 
used in the early work, such as mercury, lead 
and niobium, had to be cooled to a few degrees 
above absolute zero before they reached their 
‘transition temperature; the point at which 
electrical resistance disappears. 

In 1986, scientists identified the first in a 
family of copper-based, high-temperature 
superconductors. Within a few years, they 
had pushed the transition temperature above 
100 K — past the boiling point of liquid nitro- 
gen at 79 K, an economically important bench- 
mark. A record of 138 K was set in 1995. But 
those temperatures eventually stopped rising, 
dashing hopes that a room-temperature super- 
conductor was possible. 

Interest was rekindled two years ago when 
researchers at the Tokyo 
Institute of Technology 
synthesized a new super- 
conductor based on iron 
rather than copper. The 
material also featured 
oxygen, lanthanum 
and phosphorus, but its 
transition temperature 
was just 4 degrees above 
absolute zero, no better 
than the very first super- 
conductor discovered a 
century before. 

Then this February, the 
same group announced 
an exciting development. 
The researchers had 
replaced phosphorus 
with another pnicogen, 


A hot sandwich: the superconducting iron 
and arsenic filling in this crystal is fed 
electrons from the layers above and below. 


Could economically viable 


magnetic!lévitation be 
back on track? 


ramping up efforts and trying other pnico- 
gens such as bismuth. Every week, scientists 
are posting papers on the preprint server arXiv 
with new proclamations of the material's prop- 
erties. “It's phenomenal, because we've broken 
the tyranny of copper,’ says Paul Canfield, a 
physicist at the Ames Laboratory in Iowa. 

The iron-based family might provide a fresh 
opportunity to engineer superconductors that 
operate at practical temperatures. It also offers 
chemists a chance to finally figure out how 
high-temperature super- 
conductors work. 

Conventional low- 
temperature supercon- 
ductors were explained 
in a Nobel prizewin- 
ning theory developed 
by John Bardeen, Leon 
Cooper and Robert 
Schreiffer. Vibrations in 
the metallic crystal lattice 
squeeze electrons into 
pairs, overcoming their 
mutual repulsion. The 
paired electrons, operat- 
ing in a single quantum 
state, move freely without 
resistance. A secondary 
property of superconduc- 
tors is the blocking out of 


LaO 


arsenic, in the layered 
material and — boom — the transition temper- 
ature shot up to 26K (Y. Kamihara et al. J. Am. 
Chem. Soc. 130, 3296-3297; 2008). Subsequent 
tweaking has already boosted that temperature 
above 50 K. “Weall were surprised; says materi- 
als scientist Hideo Hosono, who led the study. 
The superconductor community is now 
buzzing. A number of groups around the world 
are working with the iron-based material, 
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magnetic fields, which 
leads to their ability to levitate magnets. 

The discovery of the copper-based family 
of ceramic superconductors raised interest- 
ing questions that are still unanswered. “We 
cannot make sense of it with the theoretical 
physics we know,” says Jan Zaanen, a con- 
densed-matter theorist at the University of 
Leiden in the Netherlands. “We're dealing 
with a big mystery.” Theorists strongly sus- 
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pect that magnetism is involved in high-tem- 
perature superconductivity. In certain types of 
metal, the movement of electrons can become 
‘jammed, and the metal becomes an insulator. 
As the electrons sit in place, they spin, forming 
tiny magnets — but with opposite polarities 
that cancel out the net field. Such materials are 
said to be antiferromagnetic. 

Like the copper-based superconductors, 
the new iron-based ones seem to be antiferro- 
magnetic. They also share a planar structure, 
where the superconducting ‘actiom occurs in 
the middle of the chemical sandwich (see dia- 
gram). As magnetism can exist at very high 
temperatures, the hope, barely expressed 
because it’s so audacious, is that an under- 
standing of the mechanism could lead to a 
way of designing custom superconductors, 
ones that work at room temperature. “Maybe, 
maybe, maybe,” says Hosono. 

Much work remains. The researchers have 
so far worked only with superconducting 
powders and need to grow crystals, which are 
more useful for experiments. Copper-based 
superconductors are easy to work with, but the 
new family involves a trickier chemistry, and 
there is the toxicity and volatility of arsenic to 
consider. Last month, a sealed tube of arsenic 
exploded at the Argonne National Laboratory 
in Illinois, when a research group started to 
synthesize the new material for the first time. 
No one was hurt, but some academics wont 
work with arsenic because they don't want their 
graduate students to take risks. 

However, any worries or doubts are, right 
now, outweighed by excitement and renewed 
enthusiasm for high-temperature superconduc- 
tivity. “It's early days,’ says Canfield, “but we've 
already doubled the transition temperature.” m 
Eric Hand 
See Editorial, page 914. 
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Politically correct names 


given to flu viruses 


Names of flu viruses appearing in the scien- 
tific literature have undergone something ofa 
mutation this year. The group of H5N1 avian 
flu viruses, once known as ‘Fujian-like’ has 
morphed into ‘Clade 2.3.4. And its ‘Qinghai- 
like’ cousin is now called ‘Clade 2.2: They are 
part of a complete revision of HSN1 nomen- 
clature recommended by the World Health 
Organization (WHO). 

The system is now more politically correct. 
It avoids the “stigmatizing labelling of clades 
by geographical reference’, according to the 
WHO. In 2006, when scientists assigned the 
name ‘Fujian-like virus’ to a vaccine-resistant 
strain of H5N1 that spread across Asia, China 
objected strongly to the name. Government 
officials argued that the southeastern Chinese 
province of Fujian was tainted by association 
as the virus spread. 

But a WHO official told Nature that discus- 
sions for a revised nomenclature were under- 
way before the spat, and that the “impetus was 
scientific”. “We recognized that naming based 
on geography was probably not a good idea for 
other reasons,’ she says. 

Flu experts say that the new naming recom- 
mendations, which have 
been posted on the WHO 
website and have been sub- 
mitted for publication in 
the journal Emerging Infec- 
tious Diseases, make a lot 
of sense. They replace an 
ad-hoc system, in which 
researchers would invent 
their own names for newly 
isolated groups, with a 
systematic one in which 
names have greater biologi- 
cal relevance. 

H5N1 viruses are clas- 


Bird flu knows no national borders. 


causing outbreaks in more than 60 countries 
from Azerbaijan to Nigeria and Pakistan. 

“T find the geographical naming system rather 
confusing and unspecific; this more precise 
numbering system is far more rigorous,’ says 
Edward Holmes, a flu genomicist at Pennsylva- 
nia State University in University Park. The new 
system does not affect the naming of individual 
H5NI1 virus sequences — held in the GenBank 
database — that make up such clades. These 
will continue to display geographic and species 
data on the sample. Geographic information 
is relevant, says the WHO official, because it 
locates the sample’s place of origin. 

But experts say that the location data in 
GenBank flu names is woefully inadequate 
for serious spatial science, such as analysing 
the spread of virus genotypes or how spread 
relates to other spatial data on land use, climate 
and trade. “There’s nothing more frustrating 
and useless to a phylogeographer than a per- 
fectly sequenced isolate labelled ‘China,” says 
Rob Wallace, an evolutionary ecologist at the 
University of California, Irvine. “With Global 
Positioning System (GPS) coordinates and 
fine enough sampling we would really be able 
to tease out how the virus 
evolves and spreads across 
farms and wetlands.” 

Indeed, not one H5N1 
sequence in GenBank cur- 
rently contains GPS coordi- 
nates, often because those 
submitting sequences have 
no use for such data. “Given 

the importance of avian flu, 
Ive been amazed to find 
out how difficult it has been 
to get information on the 
geographical distribution 
of essential variables,’ says 


sified on the basis of their 

haemagglutinin genetic sequence into ten phyl- 
ogenetic clades — distinct branches of the virus’s 
family tree — and the new system names the 
viral subgroups according to their position in 
the tree. It aims to be more logical. For example, 
Clade 2.3.4 viruses are not restricted to Fuijan 
province, they have caused cases in humans 
in China, Laos, Myanmar and Vietnam. Clade 
2.1 viruses dominate in poultry and humans 
in Indonesia, for example, whereas Clade 2.2 
viruses have the largest geographical spread, 


ecologist Marius Gilbert, of 
the Free University of Brussels. The WHO offi- 
cial agrees that the current sequence-naming 
system is not as useful as GPS coordinates, but 
says that “the WHO doesn't see its business as 
making that sort of requirement”. 

GenBank only introduced fields for latitude 
and longitude in submitted entries in 2005 and 
few scientists have so far used them, except in 
spatially aware fields, such as forest entomol- 
ogy and biodiversity. a 
Declan Butler 
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SHOWBIZ 
NEWS 


Formulaic pop 
R&B diva Mariah 
Carey has somewhat 
surprisingly titled her 
latest album E=MC?, 
not as ahomage to the 
main man Einstein, 
but because it’s 
allegedly short for 
“Emancipation equals 
Mariah Carey times 
two". Somebody please 
teach her some maths 
before she releases 
more music. 


ACTION PRESS/REX FEATURES 


ON THE RECORD 


“Contrary to 

recent press 
reports, NASA 
offices involved in 
near-Earth object 
research were not 
contacted and have 
had no correspondence 
with a young German 


student.»> 


The agency denies accepting help 
from 13-year-old Nico Marquardt in 
working out how likely we are to be 
hit by the asteroid Apophis. 


WORDWATCH 


Nerdic 

Also known as ‘geek speak’, it is 
the fastest-growing ‘language’ in 
the world, adding about 100 new 
terms every year, including the 
verb ‘to Google’. 


SCORECARD 


Moon crops 
European Space Agency 
scientists find that 


plants can grow, without needing 
extra food, in crushed-up rock 
very like the lunar surface. 


Space tourism 
Just as everyone else 
jumps on the space- 


tourism bandwagon, the world’s 
top space-travel company, the 
Russian government, announces 
that it may stop selling space 
voyages in 2010, to save more 
seats for cosmonauts. 


Sources: Mariah Daily Journal, NASA, 
Ananova.com, BBC, The China Post 
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Fake drugs: lessons for the world 


Dora Akunyili is director-general of Nigeria's National Agency for Food and Drug Administration and Control (NAFDAC). 
Since her appointment in 2001, she has led a successful crusade against counterfeit pharmaceuticals, which are 
responsible for millions of deaths worldwide each year. Nature caught up with her last week in Washington DC. 


What is top of your agenda at the moment? 
I want to bring the incidence of fake drugs 

in Nigeria down to single-digit levels. And 

I want to institutionalize our processes and 
systems at NAFDAC so that it is as strong 

as food- and drug-regulatory agencies in 
developed countries. That would not only 
save lives but would help our food and drug 
industries to capture the African market. 


Since you took office the proportion of 
fake drugs in Nigeria has fallen from some 
41% to around 10%. What did you do? 

We achieved that through restructuring 
NAFDAC, repositioning staff for better 
effectiveness, and public enlightenment 
campaigns — alerting people to the fact that 
there was a problem. We also started focusing 
on those countries that supply these drugs 

to Nigeria, especially India and China. We 
put in place Indian and Chinese nationals as 
independent analysts in these countries to 
recertify drugs before they are imported into 
Nigeria. The drug importers pay them. 

We also go anywhere in the world to 
inspect facilities producing drugs, food, 
cosmetics or any NAFDAC-regulated 
product before we register the product. 

And we insist on preshipment information 
before drugs are accepted in Nigeria. We 
must get all information, from which drug 
is coming in what quantity to the final detail 
of the conveyance vessel. And before the 
financial import document is processed by 
Nigerian banks, they must get NAFDAC 
clearance. Above all, we insist on a certificate 
of free sale. This states that before a drug is 
registered in Nigeria it must already be used 
in the country of production. 


What about the drugs that slip through? 
We mop up what is already in circulation by 
sustained, systematic surveillance. We depend 
on reports from the public, from patients, 
from hospitals, even from journalists. 

In 2003, it was Nigerian journalists who 
reported the deaths of children during 
open-heart surgery in a teaching hospital. We 
investigated and found that the drugs used 
during the surgery were substandard. In this 
mopping up exercise of fake products, we 
have closed three different drug markets and 
sealed up shops. 
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Has the problem simply moved elsewhere? 
When Nigeria got too hot for drug 
counterfeiters, they started migrating to 
other West African countries and became 

a big problem to them. We instituted the 
West African Drug Regulatory Authorities 
Network as a platform to work with other 
countries to encourage them in this fight. We 
are also occupying an important position in 
the International Medical Products Anti- 
Counterfeiting Taskforce, for which I am the 
vice-chair. 


Do you get support from the Indian and 
Chinese authorities? 

They are forced to support us. I don’t think 
they actually wanted to support us because 
fighting these counterfeit drugs weakens 
their economy. That is why they have strong 
regulations for drugs meant for domestic 
consumption and little or no regulation for 
drugs meant for export. We have banned 
30 Indian and Chinese companies and one 
Pakistani company from importing drugs 
into Nigeria. We have better support from 
India than from China. 


Do you think that the execution last year 
of the head of the Chinese drug-regulatory 
agency was an effective measure? 

It was not effective. It was a face-saving 
measure. It was because America had 
spoken. If the Chinese government is serious, 
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then who are the workers that worked with 
the man? Who facilitated what he did? No 
chief regulatory officer could do what he did 
alone. There are people farther down the 
ladder. We are still waiting for the Chinese 
government to tell us who they are and what 
they have done to them. 

I believe that another reason they killed 
him is probably because Chinese children 
died. If all of those drugs were for export, 
they wouldnt have killed him. It was because 
their own people had been affected. 


You were the subject of an assassination 
attempt five years ago. How safe do you 
feel nowadays? 

Since 2003 I have, happily, not been shot 
again. But I still experience threats here and 
there and I don’t think they have given up. 
But my security has been beefed up so much 
that it is getting more difficult for them. Right 
now [ have at least eight policemen around 
me all the time and they are well-armed. 


Why has nobody been convicted yet? 

It is a very, very unfortunate situation. After 
the shooting, the government was able to get 
at the people who organized it because before 
it happened they were boasting that after 
December 2003 there would be no NAFDAC, 
and that I would not live to see 2004. So one 
arrest led to another. The case started in 2004 
and lasted over one and a half years with 

58 exhibits and 19 witnesses. And after all 
that, the judge's verdict was that he had no 
jurisdiction to hear the case. The case was sent 
to appeal court. The appeal court ruled against 
the judge and said: ‘you must hear it, you have 
jurisdiction. That was early last year. And for 
over 12 months there has been adjournment 
on that case, after it was sent back to the high 
court. Even last week we were in court and it 
was adjourned for next week. 


Are fake drugs an issue that people in 
Washington should worry about? 

The problem is already here in America. And 
it is one that can swallow the whole human 
race if nothing is done. Resistant strains 

of microorganisms caused by substandard 
antibiotics do not need visas to travel from 
country to country. a 
Interview by Meredith Wadman. 
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Bubble-fusion engineer 
sues other scientists 


Purdue University in West Lafayette, 
Indiana, has sent federal authorities a 
report of its investigation into the work of 
nuclear engineer Rusi Taleyarkhan, Nature 
has learned. 

Taleyarkhan is known for his claims to be 
able to produce fusion by collapsing bubbles 
in deuterated acetone by bombarding 
them with sound waves. His work had 
been subject to allegations of research 
misconduct, which Purdue had been 
investigating. 

Taleyarkhan’s lawyer, John Lewis of Lewis 
and Wilkins in Indianapolis, says that the 
final decision now lies with the inspector- 
general of the Office of Naval Research in 
Arlington, Virginia. The office funded some 
of Taleyarkhan’s research through grants. 
The Purdue report is confidential, but 
Taleyarkhan plans to contact the office to 
challenge the report’s “accuracy, adequacy 
and completeness’, says Lewis. 

The engineer is also pursuing a 
defamation lawsuit, filed last month in 
Tippecanoe County in Indiana against 
scientists who questioned his claims of 
bubble fusion. The defendants include 
Purdue faculty members named in Nature 
articles, which helped trigger an institutional 
review. “The complaint filed is to clear up 
my reputation,” Taleyarkhan says. 


US biodefence agency 
appoints first director 


The body set up last year to coordinate the 
US government's biodefence efforts has 
picked its first permanent director. 

The Biomedical Advanced Research 
and Development Agency (BARDA) is 
to be headed by Robin Robinson, the 
Department of Health and Human Services 
announced last week. Robinson is currently 
deputy director of BARDA’s influenza and 
emerging disease programme, and has 
overseen the development and stockpiling 
of vaccines against the H5N1 avian flu virus. 


Anthrax scares in 2001 prompted Congress to 
boost US biodefence efforts. 


Botanical art gallery opens in London 


This image of the flowering house plant 
Medinilla magnifica by artist Margaret 
Ann Eden is now on display at the Royal 
Botanic Gardens at Kew in London. 

It is part of an exhibition in the 
world's first year-round gallery 
dedicated to botanical art. The 
exhibition features works from Kew 
Gardens’ 200,000-piece collection 
as well as from Shirley Sherwood, who 
has collected images from around 200 
artists. The gardens’ officials say that 
the Sherwood collection is possibly the 
most important private collection of 
modern botanical art in the world. 


BARDA oversees Project BioShield, a 
$5.6-billion effort to purchase biodefence 
countermeasures that has been criticized 
as a giveaway to industry (see Nature 431, 
1023; 2004). It is the culmination of much 
congressional discussion over what to do 
about biodefence after the still-unsolved 
postal anthrax attacks of 2001. 


Research assessment panel 
advised to destroy all notes 


All personal notes made by those 
conducting a massive assessment of 
university research in the United Kingdom 
should be destroyed, the organization in 
charge of the exercise has recommended. 

Last week, the Times Higher Education 
published extracts from a letter written by 
Ed Hughes, head of the team managing 
the Research Assessment Exercise (RAE), 
which determines funding allocations 
for university departments. The letter 
recommended that members of assessment 
panels destroy all personal notes relating to 
their decisions. 

However, the Higher Education Funding 
Council for England, which distributes the 
English portion of RAE-related funds, says 
that minutes of assessment meetings will 
still be published so that decisions can be 
scrutinized. The destruction of personal 
notes, the council says, is to prevent panel 
members being over-burdened with 
requests to publicly release their notes. 


GlaxoSmithKline does deal 
to develop microRNA drugs 


Pharmaceutical giant GlaxoSmithKline has 
negotiated a deal to develop drugs aimed at 
four microRNA molecules that have been 
linked to inflammatory diseases such as 
rheumatoid arthritis and inflammatory 
bowel disease. The agreement, with Regulus 
Therapeutics in Carlsbad, California, 
marks the first time a large pharmaceutical 
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company has announced a collaboration 
that is exclusively focused on microRNAs, 
which silence genes by intercepting 
messenger RNA molecules. 

Unlike small interfering RNAs that 
also work in this way, microRNAs usually 
downregulate expression of many genes 
at once. They have thus been associated 
with diseases involving multiple changes in 
gene expression, including cancer and viral 
infections. 

Regulus will initially earn US$20 million 
from the alliance, and then receive up to 
$144.5 million more for each of the four 
microRNA targets for which it meets drug- 
development milestones. 


UN ruling makes Australia 
an even bigger country 


Australia has become the first country to 
successfully claim an extension of its rights 
to the sea floor. 

Resources and energy minister Martin 
Ferguson announced on Monday that the 
United Nations had extended the area of the 
country’s continental shelf by 2.5 million 
square kilometres. “I’m pleased to announce 
that Australia, the largest island in the 
world, has been dramatically increased in 
size,’ he said. 

The country’s rights have been extended 
beyond the usual limit of 200 nautical miles 
(370 kilometres) in a number of areas, 
including those off the main Australian 
coast and a number of smaller islands. This 
gives Australia exclusive rights to exploit 
any resources on the sea floor within this 
area, including oil and gas. 

“Tt opens up a wealth of exploration 
entitlements,” says Ferguson. “This is 
potentially a bonanza.’ 

Other claims under the United Nations 
Convention on the Law of the Sea have been 
made by countries including Mexico, France 
and Great Britain. Contentious claims over 
the sea floor of the Arctic are expected from 
Russia, Canada and Norway. 
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very hour of Connie Colgan’s day is 

carefully choreographed. Rising at 

5:30 a.m. she prepares for a marathon 

of waking, dressing and feeding her six 
children before chauffeuring the eldest four to 
school. Sean and Conor, 3 years old and 19- 
months respectively, wreak havoc at home as 
their mother tidies, does laundry, cooks and 
then ventures out again to ferry the kids to 
swimming, Irish dance, piano and gymnastics 
classes. Gruelling as her schedule is, Colgan is 
adding another routine — she has enrolled her 
family in an ambitious new health study that 
will rank her brood among the most biologi- 
cally characterized humans on Earth. 

After donating blood samples, from which 
DNA will be extracted and probed at one mil- 
lion locations with gene chips, Colgan’s family, 
and potentially 80,000 other Western Austral- 
ians, will enter a newly built facility where they 
will wind their way through a maze of tests: 
whole body scans, retinal exams, hearing tests, 
muscle strength, respiratory tests and more. In 
all, about 3,500 measurements will be made 
of each participant. Then, every three years, 
they'll do it again. 

The effort, planned to start next year, is 
one of the newest in a series of population 
databases, or biobanks. By storing DNA and 
blood samples as well as the medical and fam- 
ily history of each volunteer, biobanks provide 
a tremendous resource from which research- 
ers have been able to extract risk factors for 
common diseases — such as obesity, asthma, 
depression and heart disease — that are delug- 
ing the healthcare systems of developed and 
developing nations. Genomewide association 
studies and more complex genome interaction 
analyses have begun to reveal ways in which 
these diseases might be treated. 

Beyond probing genes, however, the more 
ambitious of these biobank projects hope to 
account for the effects of diet and the environ- 
ment and even such factors as state of mind. 
With these interactions becoming cheaper and 
more realistic to peruse, biobanking efforts have 
sprung up all over the world. 

They can be pricey, difficult to 
develop, ethically complex to 
navigate and their returns are 
only speculative. But none of 
this has deterred Lyle Palmer, 
the brains behind the efforts in 
Western Australia, from devis- 
ing one of the most ambitious 
and cutting-edge biobank projects to date. 

For Palmer, chair of genetic epidemiology at 
the University of Western Australia in Perth, 
the perfect site for what he calls a “ludicrously 
ambitious” project is Joondalup — a regional 
city north of Perth — where efforts to recruit 
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participants and build excitement about the 
Joondalup Family Health Study have been 
ongoing since 2005. 

Palmer (pictured above in front of the ‘DNA 
Tower’ in King’s Park, Perth) speaks quickly, 
especially when excited. And the prospects for 
epidemiological research in Western Australia 
excite him. The public-health system is one fac- 
tor. It tracks the prescription-drug use ofall res- 
idents. Plus, roughly 40 years ago the Western 

Australia state government 
passed a law that all public 
and private hospitals would 
share patient records with 
the health department. The 
data bank contains births, 
deaths, marriage registra- 
tions, emergency-depart- 
ment diagnoses, surgical 
history, and midwife, mental-health and can- 
cer records. “I don’t know why they made that 
decision but it is a godsend that they did. And 
we are now the beneficiaries,” says Palmer. 

Drug data and medical history ofall the state's 

residents will provide a welcome context for the 
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imminent flood of genetic data. Although the 
state lacks the centuries of genealogy that help 
power biobanks in Iceland or Utah, Palmer and 
his colleague John Bass, who studies health 
informatics at Curtin University of Technology 
in Perth, are cobbling together a genealogy of 
the state’s residents that dates back to 1840. 


Line of volunteers 
Joondalup’s signature will be the scope of 
its medical testing. In planning the project, 
Palmer is pushing to not just build on other 
prospective epidemiology studies, but to blow 
them out of the water. “How about we just look 
at every single body system we can think of 
and take every single measurement?” he says. 
Once the project is in full swing, Palmer's 
colleague Anne Pratt, who is leading the data 
collection, envisions that a new volunteer will 
enter the testing facility every 15 minutes. After 
a quick interview, questionnaire, and collec- 
tion of blood, urine and saliva, each person 
will embark on a colour-coded ‘throughput 
cycle’ that will guide them through various 
tests. Among the tests they might take are 
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dual-energy X-ray absorptiometry scans show- 
ing fat and body mass, brain positron-emission 
tomography scans, multi-slice computed-tom- 
ography scans, cardiorespiratory stress tests, 
hearing tests, and measurements of back mus- 
cles endurance, lung volume and bronchial 
responsiveness, more than 20 lengths, widths 
and circumferences, more than 50 blood chem- 
icals and thousands of other variables. 

The University of Western Australia recruited 

Palmer from Harvard in 2003 to build and lead 
a world-class genetic epidemiology facility 
that would capitalize on the state's untouched 
genetic resources. He admits that “this wasn't 
the most attractive academic job or the most 
attractive from a financial point of view”. But 
the tug of family, he says, and the desire to give 
back to the system that had 
given him a free education, 
led him back to Australia. 
And, if Palmer were going to 
leave Harvard, he was deter- 
mined to create something 
that would rival all other epi- 
demiology resources. 

Palmer describes his plan 
with the uncompromising confidence of a 
luxury-car salesman with no need to oversell. 
He should be practised. He has spent the past 
four years explaining to politicians, scientists 
and companies how troves of untapped medical 
data could be harnessed to make Western Aus- 
tralia the world’s go-to region for gene hunting, 
drug testing and health research. The project 
has secured Aus$150 million (US$140 million) 
from local institutions and other collaborators 
that should carry the project for three years. 


Battle of the banks 

In the past 18 months, there has been some- 
thing of a revolution in gene hunting business. 
“We found more genes for complex diseases 
in 2007 than in the entire history of the field,” 
says Lon Cardon, a statistical geneticist and the 
newly appointed leader of the genetics division 
at GlaxoSmithKline (GSK). 

The discovery boom began as the quantity 
of genomic data hit a tipping point. After the 
human genome project was completed in 
2001, another large project began to generate 
a map of human genetic variation by catalogu- 
ing single nucleotide polymorphisms (SNPs). 
This, in turn, laid the groundwork for the rise 
of genomewide associate studies that were 
geared to look at the frequencies of these SNPs 
in disease populations and make it easier to 
link genes to diseases. Biobanking is the next 
logical step to translate genetic data into clini- 
cal applications, says Cardon. 

Atleast 18 countries have launched or are plan- 
ning population biobanks including Iceland, 


the United Kingdom, Estonia, India, 
Sweden, China, Mexico, Japan, Gambia, 
Canada and the United States. Pretty 
much any tissue collection is a biobank, 
and they vary wildly. Some are cohorts in 
which the volunteers provide a DNA sam- 
ple and are followed up over time. Others 
are snapshots, surveys with a single sam- 
ple and questionnaire but no follow-up. 
Sizes of biobanks range from just a few 
thousand to halfa million recruits. 

DeCODE Genetics of Reykjavik, Ice- 
land, has become synonymous with 
biobanking and gene hunting. It recruits 
participants through Icelandic physicians. 
Volunteers give blood, from which DNA is 
extracted, and the doctors share the diagnoses 

with deCODE. With this 
approach deCODE doesn't 
characterize the physical, 
health or behavioral qualities 
— the patients’ ‘phenotype as 
opposed to genotype — but 
rather relies on the physi- 
cians’ measurements. To date, 
the company has collected 
DNA and blood samples from 120,000 Icelan- 
dic citizens; 95% agreed to allow deCODE to 
use their DNA for any study approved by the 
national bioethics committee. 

The UK Biobank in Stockport, one of the 
largest planned biobanks, intends to recruit 
500,000 adults aged 40-69 years. When 
volunteers visit an assessment centre each 
one gives written consent, completes a 
lifestyle questionnaire, enters a one- 
on-one interview to provide medical 
history, and then undergoes a medical 
examination and sample collection. 

Roughly 20 months ago the Chil- 
dren’s Hospital of Philadelphia in 
Pennsylvania launched the Children’s 
DNA Database. The goal: to collect the DNA of 
100,000 ofits child patients and scan for genetic 
markers associated with cancer, irritable bowel 
disorder, epilepsy or diabetes, for example. 
After just a year and a half, the hospital and its 
29 satellite medical centres have amassed 48,000 
blood samples with approximately 35,000 from 
children and 13,000 from parents. 

“Tt has been very successful,” says Hakon 
Hakonarson, the director of the Center for 
Applied Genomics at the hospital and ex- 
deCODE executive. The response rate is high 
— only 10% of parents decline, and the DNA 
bank only requires a one-time contact with the 
family to collect blood. But if the parents agree, 
researchers can access the child’s electronic 
medical records, which are updated every time 
the child visits the hospital or its satellites. 

Officials at the US National Institutes of 
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Health (NIH) in Bethesda, Maryland, have 
also argued for the establishment of a 500,000 
person biobank (see Nature 429, 475-477; 
2004). Some disagreed, and a number of fac- 
tors have stood in the way, including the lack of 
a national healthcare system and linked data. 
Estimates have pegged the price tag of such a 
venture at around US$3 billion dollars, a fig- 
ure that doesn't look likely to come from NIH’s 
tight budget. 


Regulation and collaboration 
The international consortium Public Popula- 
tion Project in Genomics (P3G) keeps tabs on 
123 biobanks, mostly cohort studies, and tries 
to ensure that members follow basic legal and 
ethical guidelines. One of their major objec- 
tives is to encourage collaboration between 
biobanks and to foster data harmonization 
by standardizing ques- 
tionnaires and research 
protocols. Such collabo- 
ration, however, has its 
downside. “I find many 
of the cohorts designed 
today are very simi- 
lar — same choices of 
phenotype,’ says Tom 
Hudson, scientific director of both P3G 
and the Ontario Institute for Cancer Research 
in Toronto. “Working too closely seems to have 
suppressed innovation.” 

In this respect, isolation may have served 
Palmer well. Hudson, who was a visiting profes- 
sor at the University of Western Australia two 
years ago, says he was struck by the diversity and 
novelty of phenotypes that Palmer's team will 
be measuring. “That’s what impressed me.’ 

In addition to all the physical measurements, 
DNA and blood analysis, Palmer is also collect- 
ing data about lifestyle — health, behaviour, 
work, family, school and community — using 
Internet questionnaires that probe subjects as 
wide ranging as computer use, bullying, noise 
exposure and Australian values. 

Kari Stefansson, the chief executive of 
deCODE Genetics, calls Palmer’s “deliberate 
brute-force phenotyping” excessive. But no one 
has optimized the biobank formula, yet. “It is 
so early in the knowledge of the genome that 
we have to cast a wide net because we don't 
know what we are going to catch,” says Teri 
Manolio, director of population genomics at 
the National Human Genome Research Insti- 
tute at the NIH. “One of the challenges with a 
study like this is not picking what to include 
but what to leave out.” 

DeCODE certainly leads the pack when it 
comes to linking genes, gene variations and 
loci to specific complex diseases. But its ethi- 
cal blunders, say many in the field, have more 
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demonstrated how not to launch a biobank. 
In 1998, the Icelandic government granted 
deCODE a 12-year exclusive licence to a cen- 
tral database containing the health records of 
all Icelanders. This, in combination with 1,000 
years of genealogy and DNA from the Icelan- 
dic populations, would have proved a formi- 
dable tool for hunting disease-causing genes. 
The Icelandic Supreme Court overturned this 
decision in 2003 because such “presumed con- 
sent” rather than informed consent was uncon- 
stitutional. In 1998, the drug firm Hoffmann 
LaRoche, based in Basel, Switzerland, strucka 
US$200 million deal with deCODE for the right 
to develop drugs based on deCODE'’: data. But 
the perception ofa foreign company profiting 
from Iceland’s medical database turned the 
stomach of many citizens. The company tried 
to calm the waters by promising free drugs and 
diagnostics to Icelanders. 
“We feel very strongly 
still today that the people 
who contribute data to our 
discoveries have some tan- 
gible benefits coming from 
it? says Stefansson. 
Other biobank efforts 
have steered clear of these 
informed-consent troubles. Few, if any, prom- 
ise free drugs or diagnostics, fearing, as many 
bioethicists have pointed out, that such lures 
coerce sick individuals and their families. 
Palmer who says that deCODE began witha 
“crazy model” insists that the Joondalup Family 
Health Study has taken adequate measures to 
protect its participants. 

Despite these reassurances, Michelle Kosky, 
executive director of the Health Consumer’s 
Council of Western Australia, says that the par- 
ticipants are still vulnerable to human-rights 
violations. She is concerned that the database 
containing genetic data could be hacked, 
jeopardizing individual rights to income 
insurance protection and life insurance. And 
consent continues to plague many biobanks 
that include children, including the Joon- 
dalup study. “When children turn into legal 
adults, they don't have a say how their DNA 
and medical records are used — they’re gone, 
they're already out there,” says Patricia Roche, 
a bioethicist at Boston University School of 
Public Health in Massachusetts. “Do parents 
really have the right to do that?” 


If you build it, will they come? 

“Change the world” is the promotional motto 
emblazoned on the orange rubber brace- 
lets that Joondalup-study participants wear, 
reflecting Palmer’s belief that the discoveries 
will improve health around the globe. And, 
just as deCODE put Iceland on the map for 
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biomedical research, creating local jobs and 
opportunities, he expects that the Joondalup 
study will do the same for Western Australia. 
He plans to do this by holding the data in what 
he calls a “charitable trust’, with about ten man- 
aging academic and government institutions 
controlling who can use the biobank’s services 
and funnelling the profits back into the national 
health system. Western Australians will handle 
the studies and run trials for collaborators. No 
data, or access to raw data, will be sold. 

Unappealing as that may sound to the phar- 
maceutical industry, the arrangement seems 
to work, says Eric Schadt, director of genet- 
ics at Rosetta Inpharmatics 
—a subsidiary of Merck — in 
Seattle, Washington. Schadt 
collaborated with deCODE to 
probe the genes that underlie 
obesity and says that the com- 
pany was “extraordinarily cau- 
tious” about the protecting the 
privacy of Icelanders and only gave analysed 
results back to Rosetta. “I don’t think you 
need your hands all over the data to get value 
out of them,” says Schadt. If, like deCODE, 
Joondalup provides “high-value, scientifi- 
cally solid results” that withstand scientific 
scrutiny, Schadt thinks that “most companies 
would be happy with that”. 

They seem to be. The US Healthcare IT 
provider Cerner and IBM are already part- 
ners, and Merck, GSK and AstraZeneca have 
all been eyeing Joondalup. Dan Burns, senior 
vice-president of pharmacogenetics at GSK 
speculates that the Joondalup study will place 
gene-disease associations in context: “There 
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“Finding a geneisa 
very important step 


along the way. But it 
is only a step.” 
— Dan Burns 


are plenty of examples 
of genes or genetic risks associated with 
diseases. But we don’t know what that means, 
so the game is rapidly moving to function” 

DeCODE led the way, says Burns, but it “got 
started around this belief that the goal was to 
identify the gene and I think the whole com- 
munity has matured in its thinking. Finding a 
gene is a very important step along the way, but 
it is only a step?” 


Power in the numbers 

Joondalup, says Schadt, has the potential to 
be more powerful than deCODE. The key is 
in the in-depth phenotyping. 
“Why haven't we seen 15-20 
highly replicated [genomewide 
association study] results for 
obesity like we have seen for 
diabetes? The reason, I think, is 
that it is an incredibly complex 
disease involving so many dif- 
ferent parts of the system that the only way to 
really get a handle on that is to partition popu- 
lations based on phenotype.’ The more pheno- 
types, the easier it is to stratify populations into 
subtypes and to get a handle on more complex 
diseases such as asthma or obesity. 

Phase 4 clinical trials — post-marketing 
safety surveillance — will also be possible, 
says Palmer. “We know what drugs every- 
one has been on for the past 20 years, and we 
will have characterized them more carefully 
than any population has been characterized. 
So we can look for the subtle effects of two 
drugs interacting” That's going to appeal to 
pharmaceutical companies hoping to avoid 
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debacles such as the one over Vioxx. 

The greatest challenge that Palmer 
faces, says Troy Pickard, mayor of the City of 
Joondalup, is achieving the high participation 
rate. Low recruitment can stunt a biobank at 
its root. Lower-than-expected participation 
during the pilot phase caused concerns for the 
UK Biobank, although they say they are nowon 
track to meet their goal by 2010. In Joondalup, 
Pickard says, the community is keen to partici- 
pate and with good marketing Palmer should 
meet his goals. 

“Lyle is a very motivating guy,’ says Hud- 
son, “his drive is going to make this happen.” 
With his three years worth of funding in hand, 
Palmer is anxious to begin the study. But, 
the project is in limbo until the participating 
institutes can agree on intellectual property 
and governance. “Whenever you get lawyers 
involved there are delays,’ says Palmer, with a 
note of weariness that seems at odds with his 
upbeat demeanour. 

Palmer admits that every day, he wakes up 
and expects the whole project to fall apart, 
but somehow it continues. Colgan is also anx- 
ious for the study to begin. For a woman most 
definitely in control of her family’s day-to-day 
schedule, health represents the only unknown. 
Colgan says that she knows which dietary and 
lifestyle choices are right for her family, but she's 
concerned about all the genetic factors that 
might be beyond her control. She says she'll do 
anything to ensure the best health for her chil- 
dren. As for all the testing? She says “the kids 
will probably like it”. a 
Bijal Trivedi is a freelance writer based in 
Washington DC. 
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closed them down in 2004, saying that they 
were no longer needed. 

Another approach to ELF submarine com- 
munication was to take advantage of electro- 
jets — currents of charged particles that flow 
through the ionosphere and could act as a 
virtual antennas, transmitting messages to 
submarines. Once this idea was proven experi- 
mentally’ in the mid-1980s, physicist Dennis 
Papadopoulos, then of the Naval Research 
Laboratory in Washington, DC, began trying 
to drum up support for a new facility. 

At the time the Pentagon was shutting down 
over-the-horizon radar sites that had been 
designed to detect Soviet bombers attacking 
the United States — including one in Gakona, 
an ideal location because it is underneath an 
electrojet. So Papadopoulos, who is now at the 
University of Maryland in College Park and has 
served asa scientific adviser for HAARP since 
the project's inception, argued for building an 
ionospheric heater there. The facility would 
help the Navy to study ELF waves, it would pro- 
vide scientists with an ionospheric heater and it 
would guarantee continued life for the military 
site in Alaska, something that Alaskan Senator 
Ted Stevens, famous for steering congressional 
dollars to his home state, also liked. “That,” says 
Papadopoulos, “was the genesis.” 

But even before construction began, people 
started to speculate about what the facility could 
be used for and why it was being built. Ina news 
conference in 1990, Stevens talked about bring- 
ing energy from the aurora borealis “down to 
Earth so it could be used” to solve the world’s 
energy crises, earning him the mockery of phys- 
icists. Others such as Nick Begich, the son of 
another Alaskan lawmaker, began claiming that 
HAARP was really intended as a missile defence 


The HAARP facility includes 180 antennas. 


weapon. According to Papadopoulos, these 
claims, although far-fetched, were based ona 
sliver of truth: Bernard Eastlund, a consultant 
to one of the firms building HAARP, had filed 
a series of patents making extraordinary claims 
that HAARP-like technology could be used as a 
defence shield by transforming natural gas into 
microwaves, which would knock out incoming 
Soviet missiles. The idea, jokingly dubbed the 
“killer shield’, was even reviewed by the JASON 
defence advisory group, but was dismissed as 
“nonsense’, according to Papadopoulos. 


From annihilation to defence 
With the breakup of the Soviet 
Union, submarine communi- 
cations no longer seemed as 
crucial, and HAARP needed 
a new raison d’étre. Support- 
ers proposed new tactics, such 
as studying ELF waves’ ability 
to map out underground bun- 
kers like those found in North 
Korea, a goal that quickly drew 
scepticism. 

After the terrorist attacks of 9/11, however, 
the military found a new use for HAARP. In 
2002, a panel headed by Anthony Tether, the 
director of DARPA, recommended that the 
facility be used to study ways to counter the 
effects of a high-altitude nuclear detonation, 
which would release energetic electrons that 
could cripple low-Earth satellites. 

Electrons are produced naturally in this 
region when the solar wind, a stream of ener- 
getic particles flowing from the Sun, slams into 
the magnetic envelope that protects Earth. The 
planet has its own self-cleaning mechanism to 
rid itself of the particles: it eventually dumps 
them lower into the atmosphere through natu- 
ral auroras and lightning. Scientists are now 
looking at whether they can accelerate this 


HOW HAARP 
WORKS 


The facility's transmitters 
send radio waves upwards 
into the ionosphere, between 
100 and 350 kilometres in 
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"Scientific research 
to better understand 
Earth's ionosphere 


is a worthwhile 
endeavour.” 
— Philip Coyle 
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process by creating ‘whistler’ waves, which 
would kick the electrons into low enough alti- 
tudes — around 100 kilometres — where they 
would rain out naturally. 

No one knows for sure whether it will work. 
“Tt is what we call a data-starved area — theory is 
ahead of actual observations,’ says Paul Kossey, 
HAARP’s programme manager at the Air Force 
Research Laboratory at Hanscom Air Force 
Base, Massachusetts. Several experiments are 
being done to look at this possibility. Stanford 
University in Palo Alto, California, for example, 
is involved in the One Hop Experiment, which 
uses HAARP to inject very-low-frequency waves 
into the magnetosphere to create whistlers. The 
investigators use a buoy and ships in the South 
Pacific, where the waves fall 
back to Earth, to measure the 
presence of whistler waves’. 

Mitigating the radiation from 
an atmospheric nuclear detona- 
tion would require an entirely 
new facility, and the technology 
would be daunting. In 2006, a 
New Zealand-led group of scien- 
tists published a paper’ arguing 
that any attempt to remediate radiation could 
lead to worldwide blackouts of high-frequency 
radio waves, disrupting communications and 
navigation. And some say that countering such 
high-altitude nuclear detonations is simply 
unrealistic. “I think scientific research to better 
understand Earth’s ionosphere is a worthwhile 
endeavour,’ says Philip Coyle, a former associate 
director of the Livermore laboratory who served 
as the Pentagon's chief weapons tester during the 
administration of President Bill Clinton. But, 
he adds, they don't know how much energy 
they would need to flush the electrons, or how, 
ultimately, injecting this much energy would 
change the ionosphere. 

In the meantime, there are plenty of straight- 
forward science questions for HAARP to look 
into. The ionized part of the atmosphere has 
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long captivated researchers, going back to the 
days of Nikola Tesla, who dreamed of using it to 
send electricity around the world. In 1933, sci- 
entists found that changing the electron density 
in the ionosphere could alter the propagation 
of radio signals*. That discovery eventually led 
to the development of ionospheric heaters to 
study these and other effects. 


Bells and whistles 

Radiation from solar flares is one area of inter- 
est. “These things are really important because 
it is the radiation coming off the Sun that is the 
main cause of satellite failure or potential death 
in human space exploration,” says Michael 
Kosch, the deputy head of the communication 
systems department at Lancaster University, 
UK. Other areas include looking at the proc- 
esses that cause an aurora — when electrons in 
the magnetosphere collide with the uncharged 
particles of the atmosphere, creating the opti- 
cal emissions often seen as brilliantly coloured 
lights in the night sky. One of HAARP’s most 
cited accomplishments is the creation of the 
first artificial aurora visible to the naked eye’. 
On zapping the ionosphere, HAARP created 
a green aurora between 100 and 150 kilome- 
tres high — in the middle of a natural aurora. 
“That was something you couldn't predict,” 
says Michael Kelley, a physicist at Cornell 
University in Ithaca, New York, who has been 
involved with HAARP. 

Other ionospheric heaters around the world 
include a lower-power US facility in Arecibo, 
Puerto Rico, which has been offline since a 
flood several years ago (although plans are 
under way to refurbish it), and one in the Rus- 
sian city of Vasilsursk, which has struggled 
with funding issues. HAARP’s closest peer isa 
powerful ionospheric heater at the European 
Incoherent Scatter (EISCAT) Scientific Asso- 
ciation in northern Scandinavia. EISCAT’s 
heater has cost roughly $24 million to build 
and operate to date, and was the first to create 
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an artificial aurora, even before HAARP. 

HAARP, though, has the highest power as 
well as the most advanced optics and diagnos- 
tic equipment. But most of all, its phased-array 
radar means that the signals can be steered and 
controlled digitally. It can also create multiple 
beams, which can be shaped, 
or changed instantaneously to 
sweep north, south, east and 
west. “I think the main thing 
that makes it unique is that it 
has a much wider frequency 
operating range,” adds Kosch, 
who hasalso worked extensively 
at EISCAT. HAARP operates 
between 2.8 and 10 megahertz, 
whereas EISCAT operates 
between 3.9 and 8 megahertz. “It can operate in 
a much lower frequency range than the one we 
can use here in Europe,’ Kosch says. 

As HAARP was only finished in 2007, sci- 
entists and Pentagon officials involved in the 
project concede that management issues, such 
as allocating time at the facility, are still in the 
formative stages. In fact, one of the most recent 
HAARP experiments is something that’s not 
likely to show up in the scientific literature 
at all: an experiment done in January that 
involved sending radio waves to the Moon and 


Scientists want to better understand the processes involved in creating auroras. 
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“HAARP can operate 
in amuch lower 
frequency range than 


the one we can use 
here in Europe.” 
— Michael Kosch 


then having amateur radio enthusiasts and a 
receiving antenna in New Mexico measure the 
reflected signals. But Papadopoulos says that 
the experiment was more for the amateur radio 
community than for scientists. 

At the moment, time at the facility is divided 
between researcher-directed work, which takes 
place during ‘campaigns’ of two to three weeks, 
and military needs. “It’s a fairly complicated 
situation in which we support new research- 
ers, and new people, by getting them involved 
in the campaigns, which is relatively cheap,” 
says Kossey. “Then of course we also fund 
[military] proposals and contracts that come in 
under broad agency announcements, in which 
researchers propose research that is of interest 
to the various organizations.” 

And even though HAARP is a military- 
owned facility, academics say that access has 
not been a problem. Umran Inan, the lead sci- 
entist for the Stanford work, says that Stanford 
has been one of the most fre- 
quent users, with numerous 
graduate students and foreign 
scientists working at the site. 
“Obviously, there are security 
arrangements, because it’s a 
US Department of Defense 
facility,’ says Kosch. “ma 
foreigner — escort required 
— but I am already so famil- 
iar to the people there, and so 
familiar with the facility, that it’s not really a 
major problem? 

HAARP’s evolution may not have been 
straightforward, but itis, in the minds of many 
scientists who work there, a success. “HAARP 
has been a boon to science in this area, and I 
think the managers that run HAARP, from the 
very beginning, have involved the community,’ 
says Inan. So unlike many other Department 
of Defense facilities that are built before there 
is a clear rationale, “in this case the commu- 
nity was involved from the very beginning, so 
the properties of the facilities were all defined 
with the involvement of the community. Now, 
I think it’s a thriving success,’ he says. 

As for HAARP’s original legacy, as an 
antenna to send signals to submarines, that 
era has come and gone with the end of the cold 
war. “The communications for submarines is 
not as important any more,’ says Papadopoulos. 
“There are, he acknowledges, “no submarines 
from the other side” a 
Sharon Weinberger is a freelance writer in 
Washington DC. 
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CORRESPONDENCE 


Media right to report small 
value of antidepressants 


SIR — Your Editorial ‘No more scavenger 
hunts’ (Nature 452, 1; 2008) indicates that the 
press reaction to a recent meta-analysis of 
antidepressant efficacy was exaggerated — 
and that it was “not the media's finest hour”. 
But I do not find their reaction surprising. 

To one who has followed the scientific 
literature over the years, the fact that 
antidepressants’ superiority over placebo is at 
best modest may come as no surprise: in fact, 
this is apparent in most published clinical 
trials. But the portrayal of antidepressants as 
advertised by pharmaceutical companies, 
ratified by medical professionals, propagated 
by mass media and absorbed by the general 
public during the past two decades has been 
very different. They are promoted as highly 
efficacious drugs (not true, when considering 
effect magnitude) that correct the ‘chemical 
imbalances’ underlying depression (not 
true, when considering how little we know 
about its pathophysiology) and improve 
depressive symptoms better and faster than 
psychotherapy (not true, when considering 
most clinical trials that compare the two). 
This publicity prompted an even more 
misguided rejection of antidepressant 
therapy by some of its opponents, with 
claims that the drugs would ‘dope’ people 
into happiness and prevent them from 
working out their problems. 

Publicizing the modest effects of 
antidepressants goes some way towards 
countering the much more exaggerated 
positive hype that preceded it. Although it 
might not have been the media's finest hour, 
on this subject it was still finer than most. 
Olavo B. Amaral 
Departamento de Bioquimica, ICBS, Universidade 
Federal do Rio Grande do Sul, R. Ramiro Barcelos 
2600 (anexo), 90035-003 Porto Alegre, RS, Brazil 


Tropical farmers need 
productive alternatives 


SIR — Your Editorial ‘Markets can save 
forests’ (Nature 452, 127-128; 2008) 
proposes integrating deforestation into the 
international carbon market, but it is unlikely 
that money alone can solve the problem. 

The process of tropical deforestation 
presents a dilemma: enormous economic 
value (more than US$2,000 a year per 
hectare) is lost in favour of small private 
benefits (often less than US$100 a year per 
hectare). In a true market, an increased, 
scarcity-signalling price should provide an 
incentive to boost the supply of the scarce 
commodity. So far, this has not worked in 
the case of tropical forests. Real financial 
flows received for the provision of ecosystem 
services are vanishingly small, if they exist 
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at all — only a few beneficiaries have been 
convinced to pay for environmental services. 
This illustrates the essentially theoretical 
character (at least, up till now) of economic 
values generated by ecosystem services. 

‘Payments for ecosystem services are 
supposed to save tropical forests. But even 
ifit were possible to mobilize substantial 
payments, tropical farmers wouldn't be 
prepared to stand by and twiddle their 
thumbs while receiving them. They need 
a field of activity, so sustainable land- 
use concepts should address the social 
environment and needs of people as well. 
These concepts could be linked and possibly 
financed by such payments. 

If the people who use tropical lands are 
disregarded, we don't believe that payments 
for ecosystem services will solve the problem 
of disappearing tropical forests. What is 
needed is an economic system that keeps 
people gainfully employed in an activity that 
is ultimately productive, not destructive. 
Thomas Knoke’*, Bernd Stimm7, Michael Weber} 
*Institute of Forest Management, 

Technische Universitat Minchen 

tInstitute of Silviculture, 

Technische Universitat Munchen, 

Am Hochanger 13, 85354 Freising, Germany 


Researchers should explain 
why they use animals 


SIR — Your News story ‘Animal-rights 
activists invade Europe (Nature 451, 1034- 
1035; 2008) highlights the need for medical 
researchers to do more to communicate 

to the public the reasons why they need to 
use animals in their research and what this 
involves. All too often, there is a tendency 
to wait until extremism becomes intolerable 
before taking steps to counter it (see Nature 
452, 282; 2008). The little information about 
animal research available to the public is 
frequently oversimplified and tends to be 
over-reliant on the perceived authority of 
the author. The scientific literature usually 
requires subscription to access it and 
scientific training to understand it. This 
leaves information gaps through which 
antivivisectionist groups can push their 
propaganda. 

Organizations such as the Research 
Defence Society (www.rds-online.org.uk) do 
much to address this deficit, but have limited 
resources and cannot be expected to counter 
the animal-rights campaigners alone. Anyone 
who is wondering why somebody doesn't 
debunk misleading claims made about 
them or their colleagues should consider 
the possibility that they are that ‘somebody. 
Even those who are not prepared to go public 
can always provide detailed explanations of 
their work and that of others in the field to 
scientific advocacy campaigns. 


©2008 Nature Publishing Group 


A fact your report didn’t mention is that 
the new biomedical laboratory in Oxford 
— which, by the way, will house mostly 
rodents and very few monkeys — has been 
built. In a campaign that complemented the 
efforts of the police and government, Pro- 
Test were able to counter the animal-rights 
group Speak (“The voice for the animals’) by 
capitalizing on the overwhelming support for 
the new laboratory among Oxford students 
and local politicians. 

Extremism can be defeated, but only if 
scientists stand up and expose the myths and 
distortions that fuel it. 

Name and address supplied 


Truth about a plant with 
many names 


SIR — In the portrayal in Books & Arts of 
the University History Museum in Pavia, 
northern Italy (Nature 451, 526, 2008), the 
naturalist Giovanni Antonio Scopoli — one 
of Pavia’s many famous professors — is linked 
with the drug scopolamine. But although 
the two are connected, the compound was 
in fact named after the plant from which it is 
derived, Scopolia carniolica. 

Indeed, Scopoli (1723-1788) was also a 
physician. He worked for a time at the huge 
500-year-old mercury mine in Idrija in 
Carniola, now part of Slovenia, and recorded 
the adverse effects of mercury in miners. In 
this mountainous region, he studied the local 
plants and published his findings in the 
famous Flora Carniolica in 1760 and 1772, 
and corresponded in Latin with the Swedish 
naturalist Carl Linnaeus. 

The scopolamine-containing plant was 
first described in 1569 by Pietro Andrea 
Mattioli and was identified by Scopoli in 
the forests around Idrija as Lithophila. The 
Vienna court botanist Nikolaus Joseph von 
Jacquin in 1764 changed the name to Scopola 
carniolica, in honour of the great naturalist. 
However, Scopoli avoided using the name, 
as ‘scopola’is an insult in Venetian dialect, 
indicating a slap in the face. Linnaeus 
changed the name to Hyoscyamus scopolia 
in 1767, and in 1790 the genus name 
Scopolia was adopted. 

About 100 years later, Ernst A. Schmidt 
at the University of Marburg isolated 
an alkaloid constituent from Scopolia 
carniolica’s dried rhizome, naming the 
drug scopolamine. This sedative has also 
been used by ophthalmologists and as a 
‘truth seruny during the Cold War. 

Marko Kreft, Robert Zorec 

Laboratory of Neuroendocrinology-Molecular 
Cell Physiology, Institute of Pathophysiology, 
Faculty of Medicine, University of Ljubljana, 
Zaloska 4, 1000 Ljubljana, Slovenia, and 
Celica Biomedical Center, Tehnoloski Park 24, 
1000 Ljubljana, Slovenia 


Vol 452|24 April 2008 


nature 


COMMENTARY 


Europe's research system must change 


Science funding in the European Union needs to be revised to better serve economic, social and 
environmental goals, Luke Georghiou argues. 


Programmes have become a familiar 

funding source for projects and mobility. 
They account, however, for only about 5% of 
total public-research money’. Since 2000, the 
European Commission has sought to extend 
cooperation beyond its own programmes into 
the remaining 95% of public research, which 
is funded by national governments. It is doing 
so under the concept of the European Research 
Area (ERA) — that we at the Commission's 
ERA Rationales Expert Group have advised’ 
— using measures such as improving cross- 
border access to infrastructures, coordinating 
national research programmes and facilitating 
researcher mobility. 

During the past year a consultative green 
paper has set out ideas for a relaunch of the 
ERA’. This followed some innovation in the 
Seventh Framework Programme, which 
included the foundation of the European 
Research Council (a funding body for inves- 
tigator-driven research allocated through 
Europe-wide competition) and the industry- 
led Joint Technology Initiatives — such as the 
Innovative Medicines initiative for pharmaceu- 
ticals and the Clean Sky initiative for green air- 
craft manufacture and operation — which can 
be worth up to €1 billion (US$1.5 billion) each. 
In essence Brussels has proposed ‘more of the 
same and better’ to make the public-research 
system function more effectively. 

These measures do not get to the core of why 
we need the ERA. They do not reach out beyond 
the research community to appeal to politi- 
cians, business or the general public. There is 
the chance ofa major transfer of resources from 
agricultural subsidies to research and innova- 
tion with a new budget settlement and financial 
framework due for the European Union (EU). 
Having as the flagship policy one that focuses 
on remedies for perceived fail- 
ings in the research system lacks 
ambition commensurate with 
this broader vision. 


Fe researchers in Europe, the Framework 


Radical approach 
We need a shift in thinking from deficit to 
opportunity. We must make a convincing case 
for increased investment in research by both 
the European Union and national govern- 
ments. This will help us attain Europe's eco- 
nomic, social and environmental goals. 
Making that case requires a radically new 


“We should engage 
research with the 
problems that society 
recognizes as central.” 


Coordinated effort: European premiers Angela Merkel and Nicolas Sarkozy confer at the Airbus factory. 


approach to European research. Three key areas 
of action are needed. First, Europe’s research 
system must respond to a series of ‘grand chal- 
lenges. Second, Europe must become more 
research-friendly. This requires major reforms 
for many types of institution and the poli- 
cies that support them. It also means that the 
ERA should extend to private as well as public 
research. Third, Europe's strategic and applied 
research must be re-orientated at a pan-Euro- 
pean level to support the full range of policies 
that member states have agreed. This involves 
the Framework Programme and national pro- 
grammes — coordinated through ERA-NET 
schemes and other instruments — engag- 
ing much more effectively with policy needs 
in areas such as the environment, transport, 
energy, agriculture and health. 

Basic research has an 
important place, and promot- 
ing excellence through the 
European Research Council, 
and building research capac- 
ity through mobility programmes are wor- 
thy goals that deserve more support. But the 
bulk of the Framework Programme budget 
is in strategic and applied research so we 
need to think more deeply about its aims 
and future direction. Officially the budget is 
there to support European competitiveness 
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and public policies. Despite some significant 
achievements, successive evaluations strain 
to provide any overall picture of impact’. 
Existing structures have in general failed to 
provide the kind of linkage that could allow 
research to efficiently support economic and 
social priorities. National delegates return from 
European meetings with their performance 
indicator being the budget share their nation- 
als obtain — ‘juste retour — rather than the 
benefit their country sees from that money. 


Grand challenges 
Historically Europe has been at its best when 
dealing with large projects based on pub- 
lic-private partnerships. These include: the 
development of the Global System for Mobile 
Communications — better known as GSM 
— as the standard for mobile telephony; the 
emergence of Airbus as a global player; the 
ubiquitous use of nuclear energy in France; 
economic leadership in wind energy; and 
scientific leadership at CERN, Europe’s par- 
ticle-physics laboratory near Geneva. Simi- 
larly, in the United States and Japan many 
world-shaping innovations have emerged 
from grand challenges and coordinated 
efforts, such as the atom bomb, spaceflight, 
semiconductors and the Internet. 

Such projects created the conditions for 
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entrepreneurs and individual scientists or 
engineers to seize opportunities. Governments 
provided funding and constructed markets 
through effective regulation and procure- 
ment. Programmes may now need to be more 
agile to keep pace with rapidly moving fields; 
nonetheless the lessons of scale, vision and 
commitment are clear. 

The grand challenges will probably require a 
more direct political appeal to get going. They 
should engage research with the problems that 
society recognizes as central, such as climate 
change, food and energy security and the age- 
ing of western society. For these, the initiative 
will have to come from governments rather 
than business, although many business oppor- 
tunities will emerge as initiatives unfold. 

European governments cannot rely solely on 
the Framework Programme budget; they will 
have to coordinate their national budgets by 
accepting leadership of sub-sections of larger 
programmes, and by funding the participation 
of their own nationals at the very least. The 
challenges must be of a clear trans-national 
nature and require a minimum level of effort 
that cannot be achieved by nations acting 
alone. They must also be feasible: there must 
be a base of research and industrial capability 
to build on and a viable implementation path. 
And of course research must be a necessary 
and important part of the solution. 


Right condition, right coalition 

An exemplary response to a grand challenge 
may have recently emerged. Achieving a low- 
carbon future is arguably the single greatest 
test facing us all. The recently announced 
European Strategic Energy Technology Plan 
(SET-Plan) describes itself as a “far-reaching 
jigsaw of policies and measures”. These include 
binding targets for 2020: a 20% reduction in 
greenhouse-gas emissions; 20% of renewable 
energy sources in the EU energy mix; reduc- 
ing primary energy use by 20%; carbon pric- 
ing; a competitive Internal Energy Market and 
an international energy policy. Central to this 
plan is the need to accelerate the development 
of cost-effective low-carbon technologies. 

The challenge here is great. Energy research 
has stagnated for decades, with total EU pub- 
lic spending falling to a quarter of 1980 levels 
in real terms and substantial declines in the 
private sector as well’. Energy innovation is 
particularly difficult, involving very large 
investments with long lead times and lock-in 
to existing infrastructures. New technologies 
face social acceptance issues and often begin 
by being more expensive than the sources they 
are intended to replace. 

On the other hand, addressing climate 
change offers huge opportunities to develop 
business and employment. With similar invest- 
ment decisions in other major economies 
there is a strong competitiveness rationale. 
The SET-Plan has also developed a series of 
key technology goals for 2050. These include 
second-generation sustainable biofuels, 
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CO, capture, and large-scale commercializa- 
tion of renewable energy, energy conversion 
and efficiency. A high-level multi-government 
steering group will oversee the SET-Plan 
implementation, engaging with all stakehold- 
ers. Among other actions a series of industrial 
projects resembling the Joint Technology 
Initiatives will be launched. The hardest part 
will be securing the necessary finance. 

We need ways to identify and mobilize sim- 
ilar, necessary coalitions of interest in other 
areas. Arguably, the SET-Plan has arisen from 
a once-in-a-lifetime coincidence of practical 
urgency, political will and technological oppor- 
tunity. With grand challenges potentially cost- 
ing €5 billion to €10 billion each, it is hard to 
see Europe affording more than three or four at 
a time even with an increased budget. 

To pinpoint these ways, a new kind of politi- 
cal process is needed that combines top-down 
and bottom-up approaches. 
A bottom-up phase would 
encourage stakeholders to 
form ‘platforms’ to develop 
potential responses to chal- 
lenges. Like the present tech- 
nology platforms, these would have a wider 
base of participation: science; business users 
and suppliers; government policy-makers, 
regulators and purchasers; and where rel- 
evant, non-governmental organizations and 
consumer groups. These would use targeted 
foresight to bring together socioeconomic 
demand and the potential of innovation and 
act as both incubator and lobby. The aim is not 
to follow the now discredited idea of picking 
winners among firms or even technologies (e.g. 
joint programmes for high-definition televi- 
sion that have now been abandoned), rather to 
create a competitive and supportive environ- 
ment in which winning solutions emerge. The 
top-down element will require, at the highest 
political level, a capacity to find resources very 
quickly when a viable strategy has emerged. 
The core budget would come from a fund 
deployed until exhausted*. 


Policy-focused 
There is currently only a very general commu- 
nication between the research carried out at 
European level and the European-level policy 
and regulation setting. This is true both for 
the Framework Programme and for the ERA- 
NETS that have begun to link national funding 
bodies. Indeed it is a problem that besets much 
national applied research. New kinds of coor- 
dination are needed to link users and spon- 
sors of research, at European and at national 
levels. This does not mean a crude customer- 
contractor relationship that often causes 
research to degenerate into consultancy. It does 
mean that regulating bodies, such as the sec- 
toral Directorates-General, will need a greater 
voice in establishing the research agenda and 
greater scientific capability to do so. 

Neither the grand challenges nor the policy- 
focused research can be achieved through the 
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“Europe will be a much 
more effective research 
partner if it can speak 
with one voice.” 


present research system. This is where the new 
ecology comes in. It must consist of reformed 
actors and better linkages between them to 
configure research around these interdiscipli- 
nary challenges’. The long list of reforms that 
are overdue includes: giving greater strategic 
space and autonomy to universities; more 
trans-national peer review to raise quality 
levels; developing a true European market for 
applied research services (cross-border trade 
in applied research accounts for a negligible 
share of a market worth billions); and creating 
a market friendly to innovation through smart 
regulation and public procurement’. 

Europe has to let go of structures and 
approaches that have dominated its research 
funding for decades. First to go should be 
Framework Programmes that are divided 
into large numbers of small, very loosely con- 
nected projects defined years ahead by ‘work 
plans’ with no clear prov- 
enance. These instruments 
may satisfy the clientele they 
fund, but they are almost 
impossible to direct towards 
real problems. The funding 
breakdown needs to be tied to the big- and the 
medium-level challenges that policy dictates. 
Enough flexibility must be retained to respond 
to shifts in demand and to new scientific and 
technological opportunities. Researchers wor- 
ried about losing scarce funding should recog- 
nize that those who can adapt stand to receive 
slices of potentially a much larger cake. 

Europe should start the process of reform 
now. The Commission has a responsibility to 
take the lead and planning of the Eighth Frame- 
work Programme, due to start in 2013, is already 
under way. The grand challenges will not wait 
until then and member states, businesses and 
the scientific community must each play their 
part. The first challenge is one of leadership. 

Those outside Europe who might see this 
discussion as parochial should consider this: 
Europe will be a much more effective partner 
for the United States, Asia and others if it can 
speak with one voice, take the initiative and 
contribute a genuine critical mass to solutions 
to global problems. a 
Luke Georghiou is in the Manchester Institute 
of Innovation Research at Manchester Business 
School, University of Manchester, Manchester, 
M15 6PB, UK. He also chairs the European 
Commission's ERA Rationales Expert Group. 
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Orange revolution 


Stories of seventeeth-century scientists and aristocrats show how Dutch ingenuity benefited England. 


Going Dutch: How England Plundered 
Holland's Glory 

by Lisa Jardine 

HarperPress: 2008. 400 pp. $35, £25 


Harold Cook 
History is often told from a national perspec- 
tive, but big ideas usually have cross-border 
entanglements. Lisa Jardine’s carefully crafted 
and highly readable book describes 
how people and concepts from the 
Netherlands percolated English 
high culture in the seventeenth 
century, influencing early science. 
Going Dutch may unsettle those 
raised on the parochial view of the 
English as driving their own inde- 
pendent destiny. 

Historian Jardine begins with the 
Dutch invasion of England known as 
the Glorious Revolution of 1688-89. 
No blood was shed, yet England was 
subjected to a massive coup detat at 
the point ofa foreign prince's sword. 
The head of the Dutch army, Prince 
William of Orange (also the nephew 
of England’s King James II), gath- 
ered a fleet of more than 500 ships 
to convey his battle-hardened troops 
across the water, an operation the 
size of which was not repeated until 
D-Day in 1944. 

Marching on London, the prince 
was greeted by cheering crowds. 
Meanwhile, James II’s army with- 
drew rather than offering battle. 
A cabal of Protestant lords provided 
political cover by inviting William 
to take over the English govern- 
ment. The imprisoned James II was 
allowed to escape to France, while 


Jardine suggests that a common cause was 
possible between the sometime enemies 
because the countries were culturally close. 
Providing a family history of the English and 
their Dutch first cousins, Jardine explores per- 
sonal networks between influential characters. 
The main vehicle is the Huygens family, includ- 
ing elder statesman Constantijn and two of his 
sons Constantijn and Christiaan, the notable 


Christiaan Huygens's scientific reputation was boosted by English boasts. 


book’s core: the debates about the accuracy of 
Huygens’s pendulum clock for finding longi- 
tude in the 1660s and his balance-spring watch 
of the mid-1670s, and the discussions around 
Robert Hooke’s famous book Micrographia, 
reproducing microscopic biological observa- 
tions in exquisite detail. 

The careful reconstruction of events sur- 
rounding the adaptation of Huygens’s clock 
shows how much he depended on 
the innovations and experiments of 
his English friends. Jardine discov- 
ered new evidence in Samuel Pepys’s 
papers about how sea-trial reports 
to the Royal Society regarding the 
pendulum clock were exaggerated 
by Admiral Robert Holmes. In doing 
so, Holmes, who helped to start the 
Second Anglo-Dutch War, ironi- 
cally gave the Dutchman Huygens 
a claim to priority that obscured the 
contributions of the English. 

Showing the further intertwin- 
ing of Anglo-Dutch intellectual 
networks with those of France, 
Jardine demonstrates how in the 
mid-1660s, Henry Oldenburg, the 
first secretary of the Royal Society, 
tried to raise doubts about Hooke'’s 
Micrographia internationally. This 
helped to undermine Hooke’s repu- 
tation, contributing to the later fail- 
ures of Huygens and Isaac Newton 
to acknowledge his contributions. 
Jardine remains one of Hooke’s chief 
advocates, placing him and Huygens 
on the international stage. 

Jardine’s circles move outwards 
beyond the Huygens family and sci- 
ence to links between the house of 
Orange and the Stuarts, and to the 


a hastily convened Parliament pro- 

nounced William and his wife Mary (daughter 
of James II) as joint sovereigns, giving legiti- 
macy to the new regime. But William’s Dutch 
guard garrisoned an occupied London for years 
afterwards, just to make sure. 

Why was this quiet coup seen as import- 
ing a king rather than suffering a conquest? 
Jardine argues that Dutch victory was subverted 
by English opportunism. By ‘going Dutch’ and 
adopting the commercial and administrative 
methods of their new masters, the English 
quickly gained the upper hand, replacing the 
Netherlands as the major international power. 


scientist. Constantijn Huygens junior accom- 
panied William during the invasion of England, 
whereas Constantijn senior, a long-lived Anglo- 
phile, had served as the principal secretary to 
the house of Orange for many decades before. 
The two chapters concerning the scientific 
work of Christiaan Huygens and Robert Hooke 
are the most original. Jardine emphasizes the 
exchanges between the virtuosi of England 
and the Netherlands that amounted to an 
international scientific forum, even through 
the period of the Second Anglo-Dutch 
War of 1665-67. Two case studies form the 
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lives of English royalists in exile in 
the Low Countries during Cromwell's govern- 
ment. Although this creates the impression that 
both countries were tied by conversations and 
intermarriages among the great and the good 
(genealogical tables are supplied in the book’s 
appendix) rather than by the connections of 
ordinary people, Jardine’s strategy lets her high- 
light many topics without resorting to generali- 
zations. She addresses fine art (mostly painting), 
music, gardening and botany, science and com- 
merce, accompanied by colour illustrations. 
Going Dutch is richly evocative. One feels 
present at a masque in The Hague sponsored by 
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nobles of the Winter Queen's court, with dancing 
until 4 a.m., or walking through the estate gar- 
den of the elder Huygens, or accompanying his 
third son Lodewijk through Somerset House in 
London after the execution of Charles I to view 
the impressive royal art collection. 

This fascinating book is an excellent intro- 
duction to seventeenth-century Anglo-Dutch 
relations. Jardine does not try to summarize 
the whole field. She avoids, for instance, exam- 
ining the Anglo-Dutch wars of the period or 
the bitter rivalries abroad, gives much more 
space to royalists than republicans, scarcely 
deals with religion, and treats the formative 
period of English philosopher John Locke’s 
exile in the Netherlands very lightly. She does 
not develop fully an account of how the Dutch 


coup launched the Bank of England, nor how 
it affected Scotland and Ireland. These subjects 
are left to other works, many of them cited in 
her bibliography. Jardine presents the view 
from England more than that from across the 
North Sea, and her subtitle is perhaps more 
relevant to the eighteenth century than the 
seventeenth. Yet by exploring pertinent exam- 
ples, Going Dutch demonstrates that personal 
connections helped to shape the cultures of 
both countries. a 
Harold Cook is professor at and director of 

The Wellcome Trust Centre for the History of 
Medicine at University College London, London 
NW12BE, UK, and author of Matters of Exchange: 
Commerce, Medicine and Science in the Dutch 
Golden Age. 


Biased brains, messy memories 


Kluge: The Haphazard Construction of the 
Human Mind 

by Gary Marcus 

Houghton Mifflin: 2008. 224 pp. $24 


A Portrait of the Brain 

by Adam Zeman 

Yale University Press: 2008. 256 pp. 
$27.50 


Sandra Aamodt 
Public interest in the brain seems to be insat- 
iable, judging from the many popular books 
about the topic that have been published in 
the past few years. Highlighting the diversity 
of this expanding genre, two books aimed at 
general readers provide views of brain science 
in very different styles. 

In Kluge, psychologist Gary Marcus presents 
a lively tour of the shortcomings of human 
minds and concludes that evolution has left us 
with something of a mess. In an argument remi- 
niscent of David Linden’s The Accidental Mind, 
Marcus makes his case by describing cognitive 
difficulties, including false beliefs, linguistic 
ambiguity, impulsiveness and mental illness. 

The blame, he asserts, rests with our imper- 
fect memory, “arguably the mind’s original sin’. 
Perhaps we would reason more effectively if the 
brain could store and retrieve data as accurately 
and as simply as a computer. Instead we must 
contend with a limited system. Brains locate 
memories by matching them to the current 
context rather than having unbiased access to 
all of our experiences. This contextual depend- 
ence makes it hard during an argument, for 
example, to recall how often our spouse does 
the housework, because thinking of one failure 
inclines our brains to remember similar situa- 
tions rather than contrary examples. 

Many of these problems result from conflicts 
between the brain’s two basic styles of thinking. 
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The reflexive system, having evolved earlier, 
controls most behaviour. It is fast and can accu- 
rately assess statistics — such as the likelihood of 
finding food in certain locations — but is prone 
to overgeneralization and snap judgements. The 
deliberative system, by contrast, is slow, effortful 
and logical, at least intermittently. The reflex- 
ive system readily overrides the deliberative 
system, especially when we are tired or rushed. 
Marcus believes our lives would be improved if 
we engaged the deliberative system more often, 
although he acknowledges that “it often settles 
for reasoning that is less than ideal”. 
Evolutionary psychology has tempted 
many scientists to indulge in just-so stories, as 
Marcus notes; asserting that our brains are 
poorly engineered is an equally risky business. 
Computer memories are more factual than 
those of humans, but computers lag far behind 
the reflexive system on other problems, such as 
distinguishing cats from dogs. Whether any bio- 
logical system (or a computer) could combine 


both sets of virtues remains an open question. 

Marcus closes with experimentally verified 
ways for people to deliberate more effectively, 
a welcome change from the usual self-help 
prescriptions. For example, he suggests imag- 
ining that your decisions might be checked 
by someone else. Although few biologists will 
need to be convinced of the evolutionary argu- 
ments in this book, it remains intriguing to 
contemplate which aspects of our minds could 
be improved and how we might compensate 
for those weaknesses. 

A Portrait of the Brain is constructed with 
equal care, butin a very different style. Neurolo- 
gist Adam Zeman takes on an ambitious project: 
explaining brain function from atoms to neural 
networks to, unexpectedly, the soul. Clearly an 
Oliver Sacks fan, Zeman weaves case studies 
of patients together with basic science, history, 
etymology, classical literature and art to produce 
an erudite discourse on brain components. 

Describing the many aspects of neurons, the 
fifth chapter explains, for instance, how early 
neuroanatomists determined that brains are 
composed of cells, tells the story of a bus driver 
with religious feelings caused by epileptic 
seizures who ultimately became a priest, and is 
laced with asides about scientific rivalry, brain 
development, the hobbies of neuroscientist 
Santiago Ramon y Cajal and how action poten- 
tials are generated. The science is clear, and the 
stories of patients are suspenseful and gripping. 
Yet in some places, the relationship between 
topics feels a bit forced. 

In the final chapter, Zeman grapples with 
consciousness. He outlines how brains that 
are predisposed to tell stories and that attribute 
actions to agents rather than chance might lead 
us to believe in an immortal soul. His own 
view is that this is “no more than a wonder- 
ful fiction”. (Marcus makes the same point less 
gently.) Zeman struggles with science’ failure 
to find an emotionally satisfying replacement 
story, conceding that such questions may be 
more in the realm of art than science. 

What is the impetus for this deluge of brain 


Powers of persuasion: human brain images have piqued public interest in neuroscience and psychology. 
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books? Applications outside of science may 
explain the popularity of some topics, such 
as the potential for behavioural economics to 
illuminate the decisions of investors. Brain- 
scan images themselves may also have ampli- 
fied interest in neuroscience. Both the public 
and the media are drawn to the powerful and 
persuasive visual message of such images. 


Fundamentally, however, people remain 
interested in neuroscience and psychology 
because understanding our brains helps us to 
understand ourselves. | 
Sandra Aamodt is former editor of Nature 
Neuroscience and author of Welcome to Your Brain: 
Why You Lose Your Car Keys But Never Forget How 
to Drive and Other Puzzles of Everyday Life. 


Technological twist on taxonomy 


Systematics as Cyberscience: Computers, 
Change, and Continuity in Science 

by Christine Hine 

MIT Press: 2008. 320 pp. $35.00, £22.95 


Kevin Kelly 

Taxonomy, the science of identification and 
classification of new species, has been one of 
the slowest disciplines to adopt computers. 
When most other scientists routinely use 
these number crunchers to detect patterns 
within large sets of data, why have taxono- 
mists only recently started to use them? 

The reasons are many. Foremost has been 
the subtle variation among closely related 
species, which makes quantification of their 
traits difficult. No computer program can 
outdo the highly refined judgements of a taxo- 
nomic expert who can classify from nuanced 
alterations even the smallest organism. Con- 
sequently, new species are identified and 
described in a manner that would have been 
familiar to Charles Darwin 150 years ago. 

Second, much taxonomic information has 
been, and remains, parochial. The exper- 
tise required for classifying fly parasites has 
little in common with that for fungal species 
or whales. Taxonomic information occupies 
niches — niche being the exact biological term 
for these narrow confines. Specialized niches 
of information with their own protocols 
challenge computerization. 

Third, the low priority given to taxonomy 
has meant it is perennially underfunded. High- 
powered computation and software come low 
on the list after the meagre needs of traditional 
taxonomy are (barely) met. 

Despite these hurdles, the related field of 
systematics (exploring relationships between 
organisms over time) is rapidly transforming 
itself as computation becomes integral. 
In Systematics as Cyberscience, sociologist 
Christine Hine investigates the effects of 
computers and communication technology 
on the taxonomic community. 

Hine’s subjects are primarily the practising 
taxonomists and systematists in the United 
Kingdom, but this community is representa- 
tive of that in the rest of the world. Her con- 
cern is the sociology of change, how these 
tradition-bound disciplines imagine, place 
and incorporate computers into their identity. 
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The Encyclopedia of Life will log facts for all species on Earth. 


Holo me flea mare. 


computation. Classifications based on gene 
sequences are hugely disruptive to the estab- 
lished order of doing things. The focus of 
Systematics as Cyberscience is not rapid genetic 
taxonomy, which could fill its own book, but 
these recent advances only heighten the ident- 
ity shift now under way in taxonomy. 

As taxonomy evolves into the cyber age, it 
is reappearing in the public eye. In February 
2008, systematists launched the first digital 
version of the Encyclopedia of Life (www. 
eol.org), wherein many of the approximately 
1.8 million known species on Earth will be 
represented, each on its own web page. The 
fact that most of these pages are still 
blank does not dilute the mythical 
power of this embryonic encyclo- 
paedia. We can see its promise in the 
30,000 populated pages now in opera- 
tion. Each species page includes pho- 
tographs of the organism in its various 
life stages, the original taxonomic 
description, updatable geographical 
maps of its range, classification charts, 
links to ecological niches, popular 
knowledge of the organism, and many 
other attributes needed to understand 
its biology. 

The most important aspect of this 
emerging cyber library is the way in 
which it facilitates the completion of the 
great taxonomic imperative: to identify 

all of the living species on Earth. There 

is doubt as to how long this will take, 
but it will surely happen only with the 
help of vast communication webs and 
computer technology. 

The future of systematics will 
involve advances in computational 
photography, innovations in creating 
identification keys, pattern-recog- 


For instance, she notices that “the Web and 
the Internet are seen ... as evocative objects 
which bring a certain glamour and an image 
of modernity and accessibility to systematics”. 
Computer technology is not only an aid to get- 
ting work done, and to getting new kinds of 
work done, but also to securing the respect 
and funding needed to do any work at all. This 
strategy plays into a misperception that you 
are not doing real science unless you are doing 
it on computers. 

Asa consequence of digitalization, classi- 
cal taxonomy is put under different strains 
as it adjusts to its new tools. Hine notes that: 
“Traditionally, systematics has worked on 
long time scales, with taxonomists focusing 
on grand life works, and major flora projects 
taking maybe 30 years to complete.” Now, by 
contrast, taxonomy projects based on Inter- 
net databases are offered up in minutes, and 
are never viewed as finished. Instead they 
remain in the ‘perpetual beta’ state of most 
digital content. 

Cheap, ubiquitous genetic sequencing is 
further accelerating taxonomy’s rush into 
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nition software, expert-knowledge 
systems and, eventually, genetic identifica- 
tion gadgets. Soon, the practice of taxonomy 
will be unthinkable without the Internet. Yet, 
as Hine notes, at the moment “systematics 
has been able to portray itself as fundamen- 
tally unchanged by its experience with these 
technologies, stressing instead the way it is 
pursuing its established goals and preserving 
its heritage.” 

The long sweep of evolution breeds in those 
who try to untangle it a great appreciation of 
the winnowing process of time. Taxonomists 
are wary of tossing away any highly evolved 
method of knowledge that has proven itself. 
It is remarkable then to see the speed at which 
these refined practices are being reimagined 
for this digital century. 

Systematics as Cyberscience reminds us that 
change generates strain, conflicting views, 
fear of the new and concern for integrity. But 
we cannot stop this shift that is being played 
out by experts in their respective museums 
and natural history collections. a 
Kevin Kelly is senior maverick at Wired magazine 
and chair of the All Species Foundation. 
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Hidden treasures: the Cajal collection in Madrid 


The perceptive drawings, paintings, photographs and slides of Spain's neuroanatomy 
pioneer record a tale of ambition and rivalry, reports Alison Abbott. 


In 1906, the two Nobel laureates in medicine 
brought their hostilities to the prize cere- 
mony. Italy’s introverted Camillo Golgi used 
his speech in front of the King of Sweden to 
launch a vicious attack on Spain’s Santiago 
Ramon y Cajal and his (correct) theory that 
the brain is composed of billions of individual 
cells, or neurons. Cajal spoke only of his 
scientific observations, but his presen- 
tation rang a deliberate death knell for 
Golgi’ alternative theory that the brain 
was composed ofa continuous web of 
interconnected cells. 

The Spaniard was fire to the Italian’s 
ice. Outgoing and passionate, Cajal 
was a latecomer to neuroanatomy. He 
may have been a teenage rebel, but 
he bent to his domineering father’s 
demands to abandon all notions of 
being an artist, and studied medi- 
cine instead. After his training, 
Cajal joined the military to fight 
for Spain in the Ten Years War in 
Cuba, and came back with malaria, 
which then became complicated by 
a tuberculosis infection. Not sur- 
prisingly, his first scientific ambi- 
tion after joining the staff of the 
University of Saragossa, Spain, in 
1875 was to understand and treat 
such infectious diseases. With the 
money saved from his military service in 
Cuba, he bought his first microscope. 

Spain was well off the beaten sci- 
entific path, but Cajal always tried 
to keep abreast of developments that 
might be useful to microscopy. In 1887, 
when he was already 35 years old, he 
travelled to Madrid to watch a new 
demonstration of a method of stain- 
ing individual nerve cells with silver 
chromate. Known as the black reaction, it had 
been developed in 1873 by Golgi. One look 
down the microscope and Cajal was hooked. 
“All was sharp as a sketch with Chinese ink,” 
he later wrote in his autobiography. 

Cajal dedicated the rest of his life to neuro- 
anatomy. Gone was the sociable and outgoing 
man: no more casino visits, no more social- 
club outings, no more time-wasting with his 
expanding family. He stopped playing with 
colour photography and even gave up paint- 
ing, his first passion. 

In Europe, the race was on to crack the 
brain’s neuroanatomy, and Cajal wanted to 
win it. He improved the resolution of the Golgi 
method, and over the next 30 years he prepared 
and stained thousands of brains from various 
species in different stages of development. He 
acquired the best Zeiss microscope available 
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and, drawing on his artistic talent, precisely 
transferred what he saw onto paper. Often he 
would make a composite drawing of several 
micrographs to present a clearer, very persua- 

sive, scientific picture. 
Following the stained cells from their bul- 
bous bodies up through their long axons and 
dendrites, Cajal described the 


Ways of 
seeing: the 
artistic eye of 
Santiago Ramon y 
Cajal let him see 
brain cells as 
individual, which 
Camillo Golgi 
missed. 


way each axon terminates near the cell body of 
the next. He concluded that the anatomy of the 
brain was indeed, as other scientists were start- 
ing to suspect, based on single cellular units, as 
had been shown for the rest of the body a few 
decades earlier. He further deduced that neurons 
receive information at one end and convey it in 
one direction to the junction with the next cell. 

Cajal also made the first references to the 
brain’s plasticity, a concept that has become 
widely appreciated only in the past 20 years. 
For example, he noted that the brains of peo- 
ple with mental disabilities contain fewer small 
spines on neuronal dendrites, the branches that 
deliver information to neuron bodies. He sug- 
gested that mental activity might influence 
spine number, and that the brain changes shape 
during development. 
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In 18839, terrified that his scientific discov- 
eries would be overlooked, Cajal packed up 
his prodigious collection of slides and draw- 
ings along with his precious microscope and 
embarked on a two-month tour of European 
scientific centres. The tour culminated in his 
attendance at the third meeting of the German 
Anatomical Society in Berlin. His work spoke 
for itself — which was just as well, given 

his weak grasp of foreign 
languages — and won 
over most sceptics to the 
individual neuron theory. 
Except Golgi. The intimi- 
dating Italian made sure 
he was out of town when 
Cajal passed through Pavia 
to visit him. 
Cajal published more 
than 100 scientific papers. 

He also penned popular 

books and articles and ‘how- 

to’ manuals, including one 
that guided young scientists 
in the art of finding a wife. A potential wife 
should be wealthy, he advised, but not extrav- 
agant, and a good housekeeper who would 
shield a budding researcher from distracting 
domestic crises. 

Some members of Cajal’s family have never 
accepted that he left his entire scientific and 
artistic legacy to the institute in Madrid that he 
founded in 1902 (now the Cajal Institute, belong- 
ing to CSIC, the Spanish research council). 
This reaction has stymied all attempts to create 
an independent museum for his collection. His 
3,700 slides, nearly 2,000 exquisite scientific 
drawings, his paintings and photographs, equip- 
ment and other memorabilia are still stored 
at the Cajal Institute. Last year, the Spanish 
government agreed to recognize the collection 
as being of national interest. Full cataloguing 
has just been completed, and a museum site 
should finally be chosen this year. 

In the meantime, the collection can be 
viewed by appointment. While appreciat- 
ing the clarity of the obviously independent 
neurons in Cajal’s drawings, scientists should 
heed a warning from the story of Golgi’s failure 
to see what Cajal saw though the microscope, 
using the very stain that he developed. Golgi 
was a scientific giant in his own right, but 
he blindly suspended reasonable judgement 
when it came to the work of his rival. a 
Alison Abbott is Nature's Senior European 
Correspondent. 


The Cajal collection is at the Cajal Institute, 37 
Avenida del Doctor Arce, 28002 Madrid, Spain 
(http://tinyurl.com/3dk76e). 
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Food alert 


Joshua P. Thaler and David E. Cummings 


The gut prevents nutrient overload during a meal by promoting satiety and enhancing insulin secretion. 
New findings show that nutrients in the gut also activate a neural circuit that increases insulin sensitivity. 


Eating is essential to life, yet its episodic nature 
necessitates physiological adaptations to avoid 
excesses or deficits in circulating fuels, espe- 
cially glucose and lipids’. As the first point of 
contact with ingested food, the gastrointestinal 
tract is ideally positioned to initiate after-meal 
adaptations. Indeed, when nutrients are deliv- 
ered into the gut, homeostatic mechanisms in 
place there are activated so that blood glucose 
levels are perturbed less than when nutrients 
are delivered directly into the blood. An estab- 
lished reason for this effect is that ingested 
nutrients stimulate the release of gut peptides 
called incretins, which enhance secretion of 
the hormone insulin, the main controller of 
blood glucose levels”. On page 1012 of this 
issue, Wang et al.’ describe another gut-medi- 
ated mechanism that contributes to the regula- 
tion of glucose levels: a neural circuit, initiated 
in the intestine in response to nutrient sensing, 
that increases sensitivity to insulin. 

Among the main products of dietary-fat 
digestion are long-chain fatty acids (LCFAs), 
which are cleaved from triglycerides by 
gastrointestinal enzymes. To investigate the 
effect of intestinal LCFAs on glucose homeo- 
stasis in rats, Wang and colleagues used the 
sophisticated method of pancreatic clamping, 
which allows the insulin sensitivity of specific 
tissues to be quantified. They found that infu- 
sion of calorically insignificant amounts of tri- 
glycerides directly into the animals’ duodenum 
(the upper portion of the small intestine) mark- 
edly and rapidly increased insulin sensitivity. 

Insulin not only promotes glucose uptake 
into tissues but also suppresses production by 
the liver of glucose derived from stored fuels; 
both actions lower blood sugar levels. Intestinal 
lipid infusions in Wang and colleagues’ experi- 
ments specifically increased insulin sensitivity 
of the liver, reducing glucose output from this 
organ without affecting tissue glucose uptake. 
In unclamped rats, duodenal lipid infusions 
also contributed to glucose homeostasis, 
establishing the relevance of this mechanism 
to normal physiology. 

How can lipids in the intestine trigger these 
systemic effects? The authors found that, ini- 
tially, an LCFA metabolite called LCFA-CoA 
is sensed by the intestine. (The exact location 
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Figure 1| The intestine's contributions to glucose homeostasis. Blood glucose comes from two 
sources: externally from ingested food, and internally from the mobilization of stored fuels 

by the liver. Following a meal, lipid sensing in the intestine initiates processes that limit both 

of these sources through several mechanisms. For example, the intestine secretes satiation peptides 
such as cholecystokinin (CCK) that promote meal termination, reducing further food intake 

(red pathways). Intestinal incretin peptides augment insulin secretion and also reduce food intake 
(green pathways). Wang et al.’ now demonstrate that LCFA-CoA molecules in the intestine activate an 
intestine—brain-liver neural circuit that enhances insulin sensitivity, suppressing glucose output by 
the liver (purple pathways). In the brain, LCFA-CoA signalling also decreases both production of 
glucose by the liver and food intake. Thick red bars depict inhibition. 
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and identity of the intestinal sensor cells are 
unknown.) They also showed that the link 
between lipid sensing in the gut and insulin 
action in the liver involves an intestine-brain- 
liver circuit within the parasympathetic nerv- 
ous system, a subdivision of the peripheral 
nervous system. The LCFA-CoA signal passes 
from the gut, along the vagus nerve to the 
brain, through the hindbrain, and then back 
down vagal nerve branches that terminate in 
the liver (Fig. 1). Wang and colleagues did 
not elucidate the details of communication 
between intestine and vagus, so intermediary 
roles for gut hormones, including incretins, 
remain possible. 

They found that disruption of any com- 
ponent of this neural circuit eliminated the 
insulin-sensitizing effect of intestinal lipids, 
without affecting baseline glucose homeos- 
tasis. So the intestine-brain-liver axis serves 
not as a basal regulator of insulin sensitivity 
but as a first responder to meals, preventing the 
circulating nutrient excess that would occur 
with profligate mobilization of internal fuel 
stores following a meal. 

Together with previous work, these findings’ 
portray the gut as a rapid-response coordina- 
tor of energy input, influencing both the size 
of meals and the metabolic fate of the ingested 
nutrients. In response to food, the intestine 
produces satiety factors that promote meal ter- 
mination*. The archetype, cholecystokinin, the 
production of which is stimulated by intestinal 
LCFAs, reduces food intake via an intestine- 
vagus-hindbrain pathway (Fig. 1). Furthermore, 
food consumption stimulates gut incretins to 
enhance insulin secretion. Lastly, Wang et al. 
show that LCFA sensing in the intestine also 
increases insulin sensitivity in the liver, mini- 
mizing inappropriate nutrient efflux from this 
organ after meals. So, more than just a calorie 
conduit, the intestine emerges as a neuroendo- 
crine organ that regulates food intake as well as 
insulin secretion and action to improve glucose 
tolerance and to promote a seamless after-meal 
transition from fuel breakdown to storage. 

Wang and colleagues’ observations comple- 
ment previous studies of LCFA signalling in the 
brain. As they do in the intestine, LCFA-CoA 
molecules in the hypothalamus activate neural 
pathways that increase insulin sensitivity in 
the liver, and they also reduce food intake 
(Fig. 1). So LCFA-CoA seem to function acutely 
as a signal of nutrient abundance, triggering 
counter-regulatory responses that originate in 
the brain and the gut to limit further internal 
and external contributions to blood glucose 
levels. Unfortunately, both protective mecha- 
nisms are quickly inactivated by the ingestion of 
a fat-rich diet. For example, Wang et al? found 
that the reduction in liver glucose production 
in response to intestinal lipids ceases after only 
three days of feeding rats a high-fat diet, and 
similar observations were made for the brain's 
LCFA-CoA sensor*®. These results suggest that 
such diets promote obesity and diabetes in part 
by impairing nutrient-sensing systems that are 


942 


designed to restrict food intake and enhance 
insulin sensitivity. 

The apparent fragility of fatty-acid sensors 
in the intestine and the hypothalamus raises 
concerns about their ‘real world’ relevance. 
How important are fat sensors that can be 
overwhelmed by just three days of high-fat 
feeding? One hypothesis for the modern-day 
failure of mechanisms responsible for energy 
homeostasis posits that these systems evolved 
during periods of food scarcity and are thus 
ill-equipped for environments with abundant, 
palatable food. Moreover, the quantity and 
type of lipids in modern diets, typically high 
in saturated fats, might exert different effects 
from those observed by Wang et al.; the lipid 
infusions these authors used contained prima- 
rily polyunsaturated fats. 

A chronic surfeit of dietary fat can also 
reverse the beneficial effects of the acute 
lipid infusions Wang and colleagues studied. 
Long-term exposure to dietary lipids increases 
fatty-acid oxidation, lowering LCFA-CoA 
levels. Moreover, chronic fat intake causes 
insulin resistance through weight gain and 
lipid accumulation in muscle cells. Lastly, com- 
pelling evidence indicates’ that prolonged 
exposure to fatty acids from high-fat feeding 
and/or obesity stimulates inflammatory path- 
ways that cause insulin resistance, perhaps 
overriding the acute insulin-sensitizing effects 
of intestinal lipids. 

The revelation that intestinal nutrient sensing 
increases insulin sensitivity could aid our under- 
standing of how bariatric surgery — operations 
that promote weight loss by modifying the 
gastrointestinal tract — ameliorates diabetes. A 
procedure known as Roux-en-Y gastric bypass 


surgery causes complete remission of diabetes 
in 84% of cases’’, and increasing evidence indi- 
cates"! that this involves mechanisms beyond 
reductions in food intake and body weight. 
This operation, which alters the path of nutri- 
ents through the small intestine, increases the 
secretion of incretins. Wang and colleagues’ 
work raises the untested possibility that com- 
plementary effects on the activity of the intes- 
tine-brain-liver neural circuit might further 
improve glucose metabolism. 

The discovery of incretins led to a new class 
of antidiabetes drugs exemplified by exenatide. 
The LCFA-CoA-stimulated intestine-brain- 
liver circuit also provides potential targets for 
novel antidiabetes drugs and a conceptual basis 
for antidiabetes diets. a 
Joshua P. Thaler and David E. Cummings are in 
the Division of Metabolism, Endocrinology and 
Nutrition, Department of Medicine, Harborview 
Medical Center and VA Puget Sound Health Care 
System, 1660 South Columbian Way, S-111-Endo, 
University of Washington, Seattle, Washington 
98108, USA. 
e-mail: davidec@u.washington.edu 
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Light reined in 


Diederik Sybolt Wiersma 


Light always travels at the same speed ina vacuum, no more, no less. But in 
materials, there's room for manoeuvre: tweak the right material in the right 
way, and exciting optoelectronic properties result. 


In air, light travels at a speed of about 300,000 
kilometres per second. That means it can circle 
Earth more than seven times in one second 
— so stupefyingly fast that we generally con- 
sider the arrival of light rays to be instanta- 
neous. Finding ways to slow down and control 
the speed of light has inspired researchers for 
decades, both because it is fundamentally 
interesting and because it is potentially of 
practical use. 

If you could moderate light’s speed in a mater- 
ial well known for its superior optoelectronic 
properties — thus giving the light more time 
to interact and exchange energy with the mater- 
ial— you would be on to a winner. Writing in 
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Physical Review Letters, Shubina et al.' describe 
how they have done just that in gallium nitride. 
This semiconductor has a wide bandgap such 
that it emits light at blue wavelengths; is easily 
‘doped’ with impurities to create optoelectronic 
interactions; and is mechanically robust, even 
at high temperatures. Of the many uses gallium 
nitride has found in optoelectronic devices, the 
most cutting-edge is in the blue laser diodes 
used to read and write high-density Blu- 
ray storage discs”. This latest advance from 
Shubina and colleagues will aid the develop- 
ment of integrated components for optical sig- 
nal processing, and potentially of detectors and 
solar cells that use gallium nitride. 
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b Resonant crystal 


a Regular glass 


Figure 1| Reshaping at light speed. a, Whena 
pulse of light propagates through a block of a 
regular material with a constant refractive index 
— glass, for example — it is refracted and slightly 
delayed. b, In a strongly resonant material, 

by contrast, specific frequency components 
contained in the light pulse can be accelerated 

or delayed. As Shubina et al.' demonstrate in 
gallium nitride, if these changes act to suppress 
the leading edge (arrows) and increase the 
trailing part of the pulse, the maximum of the 
pulse envelope can be brought to a near standstill. 


Controlling the speed of light is surrounded 
by many apparent mysteries and a fair sprink- 
ling of misunderstandings. Admittedly, slow- 
ing light down until it stands still or even goes 
backwards does seem incredible. And, in the 
other direction, making light pulses travel 
faster than the speed of light in a vacuum is 
sometimes said to violate causality — wrongly, 
as it turns out. 

Both effects are in fact intimately related. 
All one needs to do is bear in mind that a light 
pulse generally contains a superposition of 
light waves of different frequencies. The speed 
at which the whole envelope of the pulse is 
propagating is known as the group velocity’. 
Within this envelope, each individual fre- 
quency component of the pulse travels at its 
own phase velocity, which is determined by the 
refractive index of the material. 

All one needs to do now is create a highly 
dispersive medium: that is, one whose refrac- 
tive index depends strongly on the frequency of 
the passing light. On entering such a medium, 
different components of the pulse will begin 
to propagate at different phase velocities. The 
envelope of the pulse begins to change shape, 
and the top of the pulse might move forwards 
or backwards with respect to the rest (Fig. 1). 
In the first case, the pulse maximum seems to 
propagate at above vacuum light speed; in the 
second case, the effect can be so strong that the 
top of the pulse seems to stand still. But note 
that in the first case causality is not violated: 
the pulse itself, carrier of information, travels 
on as fast as ever. 

A good way to achieve strong dispersion is 


by creating ‘resonant’ interactions that only 
affect light in a narrow frequency range. This 
can be achieved using a material with resonant 
microstructures, or one in which the constitu- 
ent atoms or molecules have a resonant inter- 
action with light of specific frequencies. This 
method has already been used to slow down 
light in crystals*’ and atomic gases®, and 
Shubina et al.’ now use it to put the brake on 
light in gallium nitride. 

In crystals such as gallium nitride, light 
waves at specific frequencies can excite elec- 
trons, creating ‘excitons’ that temporarily store 
the optical energy’. This is strongly resonant 
behaviour — the interaction takes place only 
when the frequency of the light exactly matches 
that of the exciton — and thus strong disper- 
sion is obtained*. When the authors shine 
a short light pulse with the right central fre- 
quency onto their sample, they observe a delay 
and elongation of the pulse, corresponding to a 
more than 100-fold reduction in velocity. 

Shubina et al. discuss two possible mecha- 
nisms that might give rise to this effect, 
although they are not able to give a definitive 
picture of the physics behind their obser- 
vations. First, the interaction of the light 
with the generated exciton might create an ‘ 
exciton-polariton’ that delays the light’s 
ballistic propagation through the material. 
Second, strong scattering effects around the 
exciton’s resonant frequency could delay the 
light waves enormously’*. Such effects have 
been observed in diffusive scattering from 
random distributions of uniform microspheres, 
and become enormous in cold atom clouds’. 

A further, much less intuitive delaying and 
trapping effect is that of optical localization”. 
Here, multiply-scattering light waves randomly 
create standing waves at points throughout 
the crystal. In strongly scattering semicon- 
ductors, the effect can become so strong that it 
brings diffusive propagation to a standstill”; 
it would be fascinating to see if it could be made 
to work in gallium nitride. 

On a more immediately practical level, 
Shubina and colleagues’ gallium nitride light 
‘brakers’ could be applied in optical signal pro- 
cessing to produce delay lines for signal syn- 
chronization or to create short-lived optical 
memories”. This would require a method for 
tuning the resonance over a wider frequency 
window, not just at one specific exciton reso- 
nance frequency. Given gallium nitride’s poten- 
tial for integrating photonics and high-speed 
electronics, the effort would be worth while. 
One can imagine, for example, a photonic crys- 
tal created by periodically patterning a wave- 
guide made of gallium nitride combined with 
other semiconductors, in which the gallium 
nitride would act both as a source of light and 
as delay components ina photonic circuit. ™ 
Diederik Sybolt Wiersma is at the European 
Laboratory for Nonlinear Spectroscopy (LENS) 
and INFM-BEC, Sesto-Fiorentino, 50019 Florence, 
Italy. 
e-mail: wiersma@lens.unifi.it 
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50 YEARS AGO 

Of the many craters on the Earth 
known to have been produced 

by fallen meteors, a few have 

left no signs of the meteor which 
caused them, apart from the 
huge holes created in the Earth's 
crust. Large meteorite fragments 
have been found in or near most 
craters. A possible explanation of 
the lack of meteorite fragments 
in the other cases is that the 
meteors concerned were of 
contraterrene constituency 
(anti-matter). In this case no 
traces of the meteors would 
remain, due to the annihilation 
process. One of these events, 
from which no fragments appear 
to have survived, has occurred in 
recent years. In 1908 a meteor, 
apparently of great size, was seen 
passing over western Russia; 
afew minutes later it crashed 

in the Siberian wastes with an 
explosion, effects of which were 
felt for hundreds of miles around, 
and seismographs thousands of 
miles away registered its impact 
... No expedition to the site of 

its landing was undertaken until 
1927, when the devastation found 
even at 40 miles from the site 
was almost inconceivable; but 

no fragments of the meteor were 
located despite the several huge 
though shallow craters which 

it had caused. If this meteor 
consisted of anti-matter an 
obvious explanation is at hand for 
the devastation found, the lack of 
meteoric material, and the great 
size reported in eye-witness 
accounts of its passage. 

From Nature 26 April 1958. 


O 


100 YEARS AGO 

At the London Institution on 
April 15, Mr Valdemar Poulsen 
lectured on “Telephoning 
without Wires.”... The progress 
made in wireless telephony 

is shown by the fact that 
conversation has been carried 
onacross Denmark from 
Lyngby to another wireless 
telephone exchange at 
Esberg, 170 miles distant. The 
reproduction of the voice was 
clear and distinct, and easy to 
recognise. 

From Nature 23 April 1908. 
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NEUROSCIENCE 


Current views on odour receptors 


Alexander Chesler and Stuart Firestein 


Insects possess refined olfactory systems that use specific receptors on 
their antennae. It emerges that these receptors not only detect odour 
molecules but, unexpectedly, can also act as ion channels. 


All creatures sample their environment for 
chemicals that indicate the presence of food, 
mates, predators, dangers and attractions 
through mechanisms that were thought to 
be evolutionarily conserved from nematode 
worms to mammals. For example, in many 
organisms, odorant molecules bind to their 
receptors on the surface of neurons, initiating 
an intracellular signalling cascade. This ulti- 
mately results in the opening of ion channels 
conveying current, thereby completing the 
transformation of these chemical signals into 
electrical signals. It was thus assumed that the 
olfactory system of flies would function simi- 
larly. But when the fly odour receptor proteins 
were finally identified’°, a new riddle surfaced: 
what are the intermediary molecules that cou- 
ple these receptors to the activation of ion 
channels in insect sensory neurons? Report- 
ing in this issue, Sato et al.* and Wicher et al. 
provide an answer to this question, but in doing 
so raise others that are just as puzzling. 

To detect odours, vertebrates and worms 
use members of a large family of G-protein- 
coupled receptors (GPCRs), which thread 
through the cell membrane seven times. The 
initial identification of odorant receptors (ORs) 
in insects relied on the assumption that these 
organisms also detect odours through GPCRs. 
In 1999, three laboratories identified’? a 
family of these ‘seven-transmembrane recep- 
tors whose expression in the antennae of the 
fruitfly Drosophila melanogaster turned out° to 
control the responses of individual olfactory 
neurons to chemical odorants. 

But the first hint that insects detect odours 
differently from mammals came from analy- 
sis of the receptors’ amino-acid sequences’ ’. 
Although the vertebrate receptors are similar 
in sequence to other known GPCRs, the fly 
receptors are not. Moreover, unlike mammals, 
in which only one gene encoding a receptor 
is believed to be expressed in a given neuron, 
in fly sensory neurons at least one member 
of the receptor family Or83b is co-expressed 
with the insect odorant receptors. (Or83b is 
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an essential co-receptor that does not bind 
to any known ligand, but is necessary for the 
intracellular transport and proper function of 
its companion receptor’.) Finally, and perhaps 
most peculiarly, the insect receptors seem to lie 
in the membrane in the opposite configuration 
to GPCRs — that is, their amino terminus faces 
the cytoplasm and their carboxy terminus faces 
the outside’. 

The observations of Sato et al.* (page 1002) 
and Wicher et al.° (page 1007) may set things 
straight, albeit in an unexpected way. Both 
teams independently find that odour stimu- 
lation of multi-unit OR/Or83b complexes in 
cells grown in culture leads to inward cationic 
currents. Moreover, they show that the ionic 
conductance and permeability of the channels 
carrying these currents are similar to those 
of other known voltage-independent, non- 
selective channels for positive ions. Together, 
the authors’ observations provide compelling 
evidence that, for odour detection, insects use 
an unusual strategy and an unusual recep- 
tor — the ‘receptor is in fact an ion channel. 
However, the two groups*” have remarkably 
different mechanistic views of the sequence of 
events that occur from ligand binding to chan- 
nel opening. 

Through various experiments, Sato et al. rule 
out any role for G proteins. Instead, they con- 
clude that the OR/Or83b complexes include an 
ion-channel function that is directly activated 
by odorant ligands. By contrast, Wicher and 
colleagues’ data implicate a signalling mol- 
ecule — the cyclic nucleotide cAMP — that is 
produced in response to G-protein-mediated 
signalling, which then activates the recep- 
tor/channel. These authors find that cAMP 
is generated in response to odours, and that 
modulators of G-protein signalling can affect 
OR/Or83b function. Most surprisingly, they 
also show that Or83b alone can form a cyclic- 
nucleotide-sensitive channel. 

How can these contradictory findings be 
reconciled? One explanation might lie in 
the timescale of events that the two groups 
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investigated. Sato et al. concerned themselves 
with the initial response that occurs within 
the first second or so after odour applica- 
tion, whereas Wicher et al. focused mainly on 
the cAMP-dependent responses that peak at 
between 30 and 60 seconds. So it could be that 
the opening of OR/Or83b complexes is regu- 
lated by both ligands and cyclic nucleotides, 
but on different timescales: odorants medi- 
ate fast detection by opening the channels, 
whereas cyclic nucleotides provide longer- 
lasting modulation. Although further work is 
needed to tease all this apart, the two studies 
add yet another unexpected twist to our knowl- 
edge of the seven-transmembrane receptor 
family: along with another GPCR channel, 
channelrhodopsin, and the structurally simi- 
lar halorhodopsin ion pump”””, there is now an 
odour-activated channel receptor. 

The coincidental arrival of these two papers 
and the subsequent review process raises an 
interesting set of editorial issues. We acted as 
a referee, and were somewhat involved in the 
process. Separately, each paper presents a con- 
vincing argument for a novel receptor-channel 
protein; taken together, they contain contra- 
dictory data that are difficult to reconcile. Of 
course, this is not the first time papers with 
contradictory views have been published. In 
the 1980s a controversy raged over the iden- 
tity of the intermediate signalling molecule in 
photoreception — calcium or cGMP. No fewer 
than five papers appeared in Nature alone 
describing results that supported one or the 
other candidate, until a pivotal experiment"! 
resolved the issue in favour of cGMP. Although 
journals, editors and reviewers have a responsi- 
bility to vet papers for accuracy and correctness, 
we must also recognize that, at its frontiers, sci- 
ence doesn’t always produce unequivocal data. 
It is no less a responsibility to ensure that the 
literature accurately reflects what is known at 
the time. As George Bernard Shaw (paraphras- 
ing Immanuel Kant) said in a toast to Albert 
Einstein:“Science is always wrong. It never 
solves a problem without creating ten more.” 
Would we have it otherwise? a 
Alexander Chesler and Stuart Firestein are in the 
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Bottom of the top of the world 


The floor of the Arctic Ocean 

comes into sharper focus with the 
publication of an improved version 
of a bathymetric chart first released 
in provisional form in 1999, and as 
version 1in 2001. Accurate mapping 
of the ocean bottom is essential for 
modelling deep ocean circulation, but 
also has a political angle in defining 
the extent of the continental shelf 
—aserious consideration in sucha 
politically sensitive part of the world 
as the Arctic. 

The story behind the improved 
bathymetric chart — IBCAO Version 
2.0 — is told by Martin Jakobsson 
and colleagues in Geophysical 
Research Letters (M. Jakobsson 
et al. Geophys. Res. Lett. 35, 07602; 
2008). Its production is an 
instructive case of new data being 
married to a reinterpretation of old. 

Most of the new data come from 
mapping missions carried out 
since 2000 with multibeam sonar 


equipment aboard various vessels, 


including USCGC Healy, RV Polarstern 


and IB Oden. Multibeam sonar 
systems differ from the sidescan 


systems used, for example, to look at 
the shape of the sea floor or to detect 


wrecks, in providing information 
mainly about depth. 

The more dramatic changes 
to version 2 over version 1 are 
that, as the authors laconically 
put it, the “deep abyssal plains 
are systematically ca. 50-60 m 
deeper...”. The revision stems from 
a metadata analysis of records 
collected by US Navy submarines 
over several decades, which are 
acentral source of bathymetric 
information at high northern 
latitudes in particular. Conversion of 
data for version 1 was based on an 
assumption that the figure for the 
speed of sound in water used for the 


original calculations was 1,500 ms". 


But in many cases the figure applied 


was 1,463 ms '. Hence the change in 
estimated depth, which also helps to 
explain several anomalies evident in 
version 1. 

The three-dimensional views 
shown here are depictions of the 
Alaskan Slope and Northwind Ridge 
before (upper image) and after 
Jakobsson and colleagues’ exercise 
in producing version 2. The image is 
about 650 km across, and the black 
area at the upper left is Alaska; the 
Northwind Ridge is the ‘peninsula’ 
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onthe right. The improved definition 
is evident in the sharper depiction of 
the gullies, caused by erosion, that 
scar the Alaskan Slope. 

The new map is far from the final 
word. The authors point out that a 
near-perfect bathymetric model will 
require comprehensive multibeam 
coverage, which won't be available 
anytime soon. Meanwhile, more 
details on version 2 and derivations of 
it are available from www.ibcao.org. 
Tim Lincoln 


ASTROPHYSICS 


Exhaust inspection 


David L. Meier 


What do you see if you peer into the exhaust of a jet engine larger than our 
Solar System? Only astronomers with the largest radio telescopes can see 
the full picture — and definitive observations are beginning to filter through. 


Cosmic jets — enormously energetic, highly 
directed beams of charged particles — can be 
ejected by stars as they form', as they die™*, 
and even, in certain instances, as they are 
reborn, rekindled by the accretion of sur- 
rounding gas”*. The energy carried away by 
a jet can help to bring a dying massive star to 
explosion in a supernova, or disperse a less 
massive one as a planetary nebula. Jets provide 
some of the heating of the gas between stars, 
and can even disrupt the intergalactic medium 
in galaxy clusters. 

Despite a profusion of hypotheses for how 
these cosmic power generators work, astrono- 
mers still don’t have a clinching theory, largely 
because of a dearth of observations with 
which to compare models. New findings from 
Marscher et al. (page 966 of this issue)’ go a 
long way to redressing this imbalance. The 
authors report how, in unprecedented detail, 
they imaged a jet engine belching out a puff of 
exhaust, allowing real-time insight into how 
jets work. 

Theories exploring the role of gas accretion 


in jet acceleration originated in the late 1970s 
and early 1980s*"°, and have more recently 
been extended to take account of features such 
as flows at close to the speed of light and the 
very hot temperatures of the jet plasma’. Many 
of the jet-forming objects are dense, accreting 
agglomerations of matter such as neutron 
stars or black holes, both of which originate 
from failed stars. But jets are also produced by 
objects such as fading ‘white dwarf’ stars, or 
even stars such as the Sun when they are first 
born. Understanding jet emission from one 
type of these objects, therefore, might lead to 
an understanding of many others. 

The favoured picture that has emerged is of a 
mechanism similar to the operation of a fighter 
plane's jet engine, but confined within the lines 
of a strong magnetic field rather than a steel 
casing. Rotation of the magnetosphere around 
the central object creates a stiff, helical field 
that acts as both a turbine and a nozzle, expel- 
ling the exhaust flow and collimating it into a 
narrow beam (see Fig. 3 on page 968). Far from 
the object, this natural pinching action can 
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overcollimate the beam, so that it starts to con- 
verge. The point of maximum convergence is 
known as the modified magnetohydrodynamic 
fast point; beyond that point, the jet can pro- 
duce a strong ‘fast-mode’ shock wave that can 
distort or destroy the well-ordered structure 
of the magnetic nozzle. Other, less dramatic, 
‘slow-mode’ shocks — pressure waves that 
travel parallel to the twisting field lines and 
do not disrupt them — can develop before the 
flow reaches the modified fast point. 

The equations governing these processes 
are very complex, and several simplifying 
assumptions are needed to solve them. For that 
reason, the whole picture of magnetic jet accel- 
eration just sketched is still a matter of debate. 
Marscher et al.’ are the first to test the picture 
observationally. Their results, at least partially, 
confirm the model. 

The authors’ test-bed was the notoriously 
unruly BL Lacertae, the archetype of a par- 
ticularly extreme form of jet emitter known as 
a blazar. BL Lacertae is an active galaxy con- 
taining a central supermassive black hole, and 
produces a jet with a speed 99% that of light. 
The jet, to within 10°, is pointing directly at us, 
so that looking down into the jet nozzle allows 
Earth-bound observers to examine the engine’s 
inner workings directly. 

This particular jet emits enormous amounts 
of energy at radio frequencies, and is huge 
— about two light years across. So, despite its 
immense distance from us (about 900 million 
light years), its angular size seen from Earth 
is fairly large, at around 0.5 milliarcseconds 
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(one arcsecond is 1/3,600th of a degree). 
Regular stellar jet engines in our Galaxy might 
be much closer, but they are diminutive by 
comparison, just nanoarcseconds in size. 
The jet from BL Lacertae was big enough for 
Marscher et al. to exploit a technique known as 
very-long-baseline interferometry (VLBI), 
which uses radio telescopes scattered around 
Earth's surface to create essentially one big 
radio telescope. In this case, these were the 
ten telescopes of the National Radio Astron- 
omy Observatory’s Very Long Baseline Array 
(VLBA), situated across the United States. 
Using this technique, the authors could make 
images of the jet with an angular resolution of 
about 0.1 milliarcseconds. 

The VLBI technique, coupled with y-ray, 
X-ray and optical-wavelength observations, 
allowed the authors to observe what happened 
as BL Lacertae sent a new pulse of plasma 
down its jet nozzle (see Fig. 1 on page 966). 
During the period of observation, the source 
twice sent out a flare of very-high-energy y- 
rays. The first event looked very much like a 
slow-mode shock wave, tracing the plasma 
pulse as it travelled around one coil of the heli- 
cal magnetic field. The field would thus seem 
to be well ordered up to that point, as predicted 
by theoretical models. 

The second y-ray flare coincided with a 
strong brightening of the radio ‘core’; by this 
time, the pulse had probably left the magnetic 
nozzle and was passing through the strong 
fast-mode shock. Unfortunately, measurement 
limitations mean that we cannot be certain 
whether the magnetic field was helical or dis- 
ordered in this core region during the second 
flare, although the strengthening and weaken- 
ing of the radio polarization during and after 
the flare points to the second option. In that 
case, the fast-mode shock seems to have at least 
partially disrupted the helical field structure. 
Thus, despite the relative simplicity of the theo- 
retical models, many of their predictions seem 
borne out by observations. This confirmation 
of the theoretical understanding of systems 
containing supermassive black holes increases 
the likelihood that jets of all sizes and types 
work in a similar way. 

At present, jet acceleration from any cosmic 
object can be imaged only using the technique 
of VLBI at radio wavelengths. Observations 
at y-ray wavelengths alone create the impres- 
sion that flares may be produced within one- 
thousandth of a light year of a central black 
hole, but these latest observations’ indicate 
that they occur more than a light year away. 
Thus, when NASAs y-ray Large Area Space 
Telescope GLAST, which is due to launch on 16 
May, starts to deliver jet measurements, it will 
need VLBI co-observations to deliver a com- 
plete picture. Japan’s new VSOP-2 satellite will 
increase the radio imaging resolution of jets 
such as that of BL Lacertae by at least a factor 
of three, but to complete much of its proposed 
work it will need instruments such as the VLBA 
to supply detailed interferometry images’. 
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Unfortunately, VLBI instruments are currently 
facing a funding gap that could see them scaled 
back or closed. The results of Marscher et al.’ 
underscore how vital these instruments are to 
the understanding of jet acceleration through- 
out the Universe. | 
David L. Meier is at the NASA Jet Propulsion 
Laboratory, California Institute of Technology, 
4800 Oak Grove Drive, Pasadena, 

California 91109, USA. 

e-mail: dlm@sgra.jpl.nasa.gov 
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GENE TRANSCRIPTION 


Two worlds merged 


David M. Lonard and Bert W. O'Malley 


Why would two distant genes — on separate chromosomes and from 
different nuclear locations — unite in response to signals for gene 
expression? They might be seeds for the formation of transcriptional hubs. 


Gene transcription occurs largely at the sub- 
microscopic scale. So although microscopic 
analysis of nuclear architecture has implicated 
various structures in this process’, it has lacked 
the power to unravel the role that higher-order 
organization of chromatin (complexes of DNA 
and histone proteins) has in the expression of 
individual genes. Consequently, it has been 
difficult to combine whole-cell approaches 
such as microscopy with the molecular and 
biochemical techniques’ that are primarily 
used to study gene expression. Fortunately, 
technological advances in both microscopy 
and molecular approaches are closing this 
gap. For instance, writing in Cell, Nunez et al? 
bring a refreshing concordance between these 
two types of method to show that gene expres- 
sion associated with activation of the nuclear 
receptor ERa (oestrogen receptor-a) depends 
in part on a large-scale reorganization of the 
genome that involves interactions both within 
and between chromosomes. 

Much of the recent progress in understand- 
ing how gene transcription factors interact 
with chromatin can be credited to ‘ChIP-on- 
chip’ technology’. For example, this method 
revealed that ERa interacts with a specific set of 
DNA sequences known as oestrogen-response 
elements’ (EREs), and that only some of these 
interactions enhance transcription. Other 
transcription-factor binding sites adjacent to 
EREs also seem to function cooperatively in 
facilitating transcription, indicating that vari- 
ous non-coding DNA sequences influence 
gene expression’. 

For their analysis of chromatin reorgani- 
zation in response to the activation of ERa 
transcription, Nunez et al. used many sophis- 
ticated approaches, one of which — chromatin- 
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conformation capture — is also related to 
ChIP-on-chip. This technique allows the 
detection of long-range interactions between 
genes, even those on different chromosomes’. 
Specifically, the authors aimed to characterize 
the interaction between two genes regulated 
by the oestrogen hormone: TFF1 on chromo- 
some 21 and GREB1 on chromosome 2. They 
find that, in response to hormone treatment, the 
chromosomal regions (loci) containing these 
genes physically reach out for each other. 

Nunez and colleagues also find that genes 
at other oestrogen-regulated loci interact with 
each other, indicating that interchromatin pair- 
ing of DNA sequences is a common feature of 
ERa-mediated transcription. What's more, pre- 
vious work using another approach — assess- 
ing the effect that restricting genes’ movement 
within the nucleus has on their transcription 
— has shown that other functionally related 
genes, such as those with roles in immunity or 
cellular differentiation, also make interchroma- 
tin contacts’. So it seems that interchromatin 
interactions are not restricted to a particular 
cellular process. 

Interchromatin contact between TFFI and 
GREB1 depends on the binding of ERa to an 
ERE. Moreover, these interactions are also 
influenced by the activity of other proteins, 
including chromatin remodelling factors and 
proteins that bind to the cytoskeletal protein 
actin; this indicates that passive movement is 
not sufficient for chromosomal reorganiza- 
tion’. Nunez et al.’ add to the list of proteins 
known to mediate interchromatin interactions, 
showing that transcriptional coactivators such 
as SRC-3/AIB1, CBP, p300 and PBP are also 
involved in this process. Furthermore, they 
show that another co-regulatory protein — the 
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Figure 1| Rambling genes. a, Nunez et al.’ find that, in the absence of oestrogen, two genes (TFF1 and GREB1) that are activated by this hormone and 

found on different chromosomes, reside in different locations within the nucleus. b, On exposure to oestrogen, these genes make interchromatin contacts, 
facilitated by the nuclear receptor ERa, transcriptional coactivators (CoAs), actin and molecular motor proteins. c, Subsequently, the gene pair interacts with 
a nuclear speckle in an LSD1-dependent manner, creating a multiprotein complex that might act as a transcriptional hub for DNA transcription into mRNA. 


ERa-associated demethylase enzyme LSD1, 
which does not influence interchromosomal 
gene interaction — is required for the sub- 
sequent colocalization of the gene pairs with 
nuclear speckles (structures composed of high 
concentrations of specific proteins involved in 
messenger-RNA splicing), thereby probably 
leading to mRNA maturation (Fig. 1). 
Transcriptional coactivators are multipro- 
tein complexes that, collectively, have diverse 
enzymatic activities’. So it is tempting to specu- 


late that, together with LSD1, distinct enzymes 
of a coactivator complex mediate actin poly- 
merization (which might facilitate gene 
movement within the nucleus), interchroma- 
tin interactions and chromatin interface with 
transcription ‘factories’ — structures that 
generate mRNAsat higher rates. Ultimately, it 
is likely that coactivators and other recep- 
tor-associated proteins work together to 
orchestrate the formation of an ERa-activated 
transcriptional supercomplex comprising 


PLANT BIOLOGY 


Scent of arose 


What Chinese roses lack in 
resistance to cold and disease, 
they make up for with repeated 
blooming and a distinctive 
fragrance. It isn't surprising, 
then, that plant breeders, 
fascinated with this ancient 

rose family, brought it to Europe 
in the nineteenth century to 
generate the now popular hybrid 
tea roses, among many others. 
But how did the distinctive scent 
of Chinese roses evolve? Gabriel 
Scalliet and colleagues now 
provide the answer (G. Scalliet 
et al. Proc. Natl Acad. Sci. USA 
105, 5927-5932; 2008). 

Unlike their European 
counterparts, the main scent 
component of Chinese roses 
and their descendants is 
dimethoxytoluene. In the final 
steps of its formation, two 
methyl groups are sequentially 
added to a precursor molecule 
in reactions catalysed by 
enzymes called OOMT1 and 
OOMT2. These enzymes are 
almost identical in amino-acid 


sequence, yet they target 
different substrates. 

Scalliet and colleagues 
pinpoint the crucial amino-acid 
residues of the enzymes that 
confer substrate specificity: 

a tyrosine in OOMT1anda 
phenylalanine in OOMT2. 
Swapping these single residues 
between the two enzymes 
switched their substrate 
specificity, most probably 

by changing the steric and 
hydrophobic properties of their 
substrate-binding sites, where 
these residues reside. Moreover, 
OOMT2 could methylate the 
target of OOMT1in vitro, albeit 
with lower efficiency. 

Two nearly identical enzymes 
with nearly identical activities, 
that can both perform similar 
reactions, although together 
they are more efficient than 
alone — these are hallmarks 
of the products of duplicated 
genes, which arise when a gene 
doubles in number and one of 
the copies then undergoes a 


minor mutation. Indeed, Scalliet 
et al. show that, although all of 
the 13 European rose species 
they examined carry an OOMT- 
like gene, only the Chinese 
species, which are evolutionarily 
younger, carry two types of 
this gene. The authors’ further 
phylogenetic analysis strongly 
hints that, in Chinese species, 
the second OOMT gene arose 
through a duplication event. 
Roses are not unique in 
acquiring new functions through 
gene duplication; this is thought 
to be a fundamental mechanism 
for generating diversity between 
and within organisms. But it isn't 
always possible to trace such 
changes, which makes Scalliet 
and colleagues’ work of interest 
not only to plant biologists but 
also to evolutionary scientists. 
Sadaf Shadan 


©2008 Nature Publishing Group 


ERa itself, coactivators, mRNA-splicing fac- 
tors, transcription factories and two or more 
oestrogen-regulated genes (Fig. 1). 

The possibility of ERa-regulated genes con- 
verging into distinct hubs gives rise to some 
interesting questions. How common is the 
above process among other genes regulated by 
nuclear receptors? What signals define how 
one gene finds its appropriate partner gene? Do 
these transcriptional hubs contain functionally 
related genes, or is their organization simply 
spatial? Could it be that sequences close to a 
gene, or possibly specific chromatin modifi- 
cations at each ERa-regulated locus, provide 
proper pairing information? 

Many EREs are located in stretches of DNA 
between genes, far away from known coding 
sequences. Do these sites also engage with 
interchromatin loci pairs, and, if so, to what 
extent do they influence the transcription of 
the partner locus? Do large-scale structural 
rearrangements induced by oestrogen, or other 
signals that activate oestrogen receptors, affect 
genome-wide expression of genes that are not 
directly regulated by ERa? These are some of 
the intriguing questions that remain. But with 
the availability of sophisticated techniques 
such as those used by Nunez et al.’, there is 
hope that the mysterious relationship between 
higher-order nuclear architecture, chromatin 
organization and gene expression will soon be 
revealed. a 
David M. Lonard and Bert W. O'Malley are in the 
Department of Molecular and Cellular Biology, 
Baylor College of Medicine, One Baylor Plaza, 
Houston, Texas 77030, USA. 
e-mail: berto@bcm.tmc.edu 
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The genome of the model beetle and pest 
Tribolium castaneum 


Tribolium Genome Sequencing Consortium* 


Tribolium castaneum is a member of the most species-rich eukaryotic order, a powerful model organism for the study of 
generalized insect development, and an important pest of stored agricultural products. We describe its genome sequence 
here. This omnivorous beetle has evolved the ability to interact with a diverse chemical environment, as shown by large 
expansions in odorant and gustatory receptors, as well as P450 and other detoxification enzymes. Development in Tribolium 
is more representative of other insects than is Drosophila, a fact reflected in gene content and function. For example, 
Tribolium has retained more ancestral genes involved in cell-cell communication than Drosophila, some being expressed in 
the growth zone crucial for axial elongation in short-germ development. Systemic RNA interference in T. castaneum functions 
differently from that in Caenorhabditis elegans, but nevertheless offers similar power for the elucidation of gene function and 


identification of targets for selective insect control. 


By far the most evolutionarily successful metazoans', beetles 
(Coleoptera) can luminesce (fireflies), spit defensive liquids (bom- 
bardier beetles), visually and behaviourally mimic bees and wasps, or 
chemically mimic ants that detect intruders by their foreign odour. 
Many beetles (for example, boll weevil, corn rootworm, Colorado 
potato beetle and Asian longhorn beetle) are associated with billions 
of dollars of agricultural and natural resource losses. 

The red flour beetle, Tribolium castaneum, found wherever grains 
or other dried foods are stored, has a highly evolved kidney-like 
cryptonephridial organ to survive such extremely dry environments. 
It has demonstrated resistance to all classes of insecticides used 
against it. Like all beetles, Tribolium has elytra (wing covers) that 
coordinate precisely with folding wings, allowing flight while provid- 
ing protection. 

Tribolium facilitates genetic analysis with ease of culture, a short 
life cycle, high fecundity, and facility for genetic crosses (see ref. 2), 
allowing efficient genetic screens by means of chemical mutagens, 
radiation and binary transposon systems’. As in Caenorhabditis ele- 
gans, RNA interference (RNAi) is systemic in Tribolium, facilitating 
knockdown of specific gene products in any tissue, developmental 
stage or offspring of double-stranded (ds)RNA-injected females*”. 

Particularly favoured for developmental studies, Tribolium is 
much more representative of other insects than is Drosophila’. In 
contrast to Drosophila, Tribolium larvae display eyes in a fully formed 
head and three pairs of thoracic legs (Supplementary Fig. 1). In 
addition, Tribolium develops via short-germ embryogenesis where 
additional segments are sequentially added from a posterior growth 
zone (Supplementary Fig. 1). This proliferative mechanism of seg- 
mentation differs from the Drosophila model, but resembles that of 
vertebrates and basal arthropods such as millipedes’. 


Genome sequence and organization 

Approximately 1.52 million sequence reads (7.3 coverage) were 
generated from the highly inbred Georgia 2 (GA2) strain and 
assembled into contigs totalling 152 megabases (Mb) and scaffolds 
spanning ~ 160 Mb of genomic sequence (Supplementary Tables 1—4 
and Supplementary Information). Almost 90% of this sequence was 
mapped to the ten Tribolium linkage groups using a genetic map of 


~500 markers generated from the GA2 strain’. Excluding hetero- 
chromatic regions dense in highly repetitive sequences, the gnome 
is well represented and of high quality (see Supplementary Data for 
details). 

G+C content. Tribolium, like Apis, has a very (A+T)-rich genome 
(33% and 34% G+C, respectively), but Tribolium G+C domains 
lack the extremes of G+C content present in Apis mellifera (Fig. 1 
and Supplementary Fig. 3). Despite global G+C similarity to Apis, 
genes in Tribolium, as in Anopheles and Drosophila but not Apis, show 
a bias towards occurring in (G+C)-rich regions of the genome 
(Fig. 1). Whatever mechanism drives the accumulation of A+T 
nucleotides in Tribolium, it does not affect genes in the manner 
observed in the honeybee, where perhaps additional mechanisms 
are present. 

Repetitive DNA. Fully one-third of the Tribolium genome assembly 
consists of repetitive DNA, which is also (A+T)-rich. Compared to 
other insects, there is a paucity of microsatellites (1-6-base-pair (bp) 
motifs) in Tribolium?. However, Tribolium contains a relative excess 
of larger satellites, including several with repeat units longer than 
100 bp (2.5% of the Tribolium genome compared with 0.7% in 
Drosophila). Most (83%) of the microsatellites are found in inter- 
genic regions (63%) or introns (20%), but there is strong over- 
representation of non-frameshift-causing repeats (3- and 6-bp 
motifs) due to a dearth of dinucleotide repeats (see Supplementary 
Information). Of 981 randomly chosen microsatellites, 509 (55.2%) 
are polymorphic in a sample of 11 Tribolium populations from 
around the world’, providing an extensive collection of markers for 
population studies. Preliminary efforts to assess global population 
structure show a shallow but significant correlation between geo- 
graphic and genetic distance (Supplementary Fig. 4). This suggests 
that anthropogenic dispersal may maintain a modest level of gene 
flow across vast distances in this human commensal. 

Transposable elements. Transposable elements and other repetitive 
DNA accumulate in regions along each linkage group that resemble 
the pericentric blocks of heterochromatin visible in HpalI-banded 
chromosomes"’. These regions are probably composed largely of 
highly repetitive heterochromatic sequences, and represent most of 
the 44-Mb difference between the estimated genome size (0.2 pg or 


*Lists of participants and affiliations appear at the end of the paper. 
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204 Mb"') and the current assembly (160 Mb). Indeed, as much as 
17% of the Tribolium genome is composed of a 360-bp satellite” that 
constitutes only 0.3% of the assembled genome sequence. Several 
families of DNA transposons, as well as long terminal repeat (LTR) 
and non-LTR retrotransposons, constituting approximately 6% of 
the genome, were identified via encoded protein sequence similarity 
to previously identified elements using TEPipe or BLAST, and are 
listed in Supplementary Table 5. 

Telomeres. Tribolium has a telomerase and telomeres containing 
TCAGG repeats", a variant of the standard arthropod TTAGG telo- 
meric repeat. Manual assembly of the proximal regions of multiple 
telomeres beyond the ends of the assembled scaffolds (Supplemen- 
tary Information) reveals TCAGG repeats interrupted by full-length 
and 5’-truncated non-LTR retrotransposons belonging to the R1 
clade, best known for insertions in the rDNA locus’. Tribolium 
telomeres range in length from 15 kilobases (kb) upwards and prob- 
ably represent a stage intermediate to the loss of telomeres and telo- 
merase in Diptera compared with the simple canonical structure of 
the honeybee”? or the more regular insertion of non-LTR retrotrans- 
posons into the simple repeats of the silkmoth”®. 


Gene content and the proteome 

Comparative gene content analysis. To understand the consensus 
set of 16,404 gene models in the context of other available insect and 
vertebrate genomes, all genes were classified according to their degree 
of similarity using systematic cross-species analysis. Five insects 
(Drosophila melanogaster, Anopheles gambiae, Aedes aegypti, T. cas- 
taneum, A. mellifera) and five vertebrates (Homo sapiens, Mus 
musculus, Monodelphis domestica, Gallus gallus, Tetraodon nigrovir- 
idis) with similar phylogenetic branching orders were chosen for 
the comparison. We found the fractions of universal and insect- 
specific orthologues in Tribolium similar to other insect genomes, 
as expected, whereas the number of genes without similarity is 
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Figure 1| Cumulative distribution of genic and genomic G+ C-content 
domains in Apis mellifera, Anopheles gambiae, Drosophila melanogaster and 
Tribolium castaneum. Cumulative distributions show the fraction of genes 
(thick lines) or of the entire genome (thin lines) occurring in G+ C-content 
domains less than a given percentage G+ C (<X). The more (A+T)-rich half 
of the T. castaneum genome contains only 30.8% of all T. castaneum genes 
(31.4% and 33% of A. gambiae and D. melanogaster genes, respectively), 
whereas the more (A+T)-rich half of the A. mellifera genome contains 
77.6% of its genes. At every point on the T. castaneum, A. gambiae and 

D. melanogaster curves there are fewer genes present in the fraction of 

the genome less than a given percentage G+ C than would be expected if the 
genes were randomly distributed. In contrast, A. mellifera exhibits the 
opposite distribution. 
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considerably higher (Fig. 2), possibly attributable to less stringent 
gene prediction. 

Over 47% of Tribolium genes (7,579) are ancient, with traceable 
orthologous relations between insects and vertebrates including 15% 
(2,403) universal single-copy orthologues. Another 1,462 Tribolium 
genes (9%) constitute the core of what are currently insect-specific 
orthologues. In comparison, 21% (4,937) of human genes have 
vertebrate-specific orthologues. 

Several hundred ancient genes seem to be under limited evolution- 
ary selection and were independently lost in several species studied 
(the patchy fraction, defined in Fig. 2). Each new genome uncovers 
previously invisible ancestral relations among genes—for example, as 
many as 126 orthologous gene groups shared between Tribolium and 
humans seem to be absent from the other sequenced insect genomes 
(Fig. 3 and Supplementary Table 10), 44 of which are single-copy 
genes present in all vertebrates. 

The evolutionary emergence of many predicted Tribolium genes is 
not clear. Thousands of genes currently appear to be species-specific 
as either no sequence similarity to other genes is detectable, or homo- 
logy but not orthology can be determined. Reassuringly, this fraction 
is similar in Tribolium and Drosophila. 

We quantified the species phylogeny using a maximum likeli- 
hood approach with the concatenated multiple alignment of 1,150 
universal single-copy orthologues present in all the organisms 
studied—an ideal genome-wide data set of essential genes evolving 
under similar constraints (Fig. 2 and Supplementary Fig. 6). This 
analysis confirmed previous analyses based on expressed sequence 
tag (EST) sequences that the Hymenoptera are basal within the 
Holometabola’’. The shorter branch length for Tribolium implies 
that the elevated rate of evolution observed in Drosophila and 
Anopheles occurred more recently'®. 
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Figure 2 | Insect gene orthology. Comparison of the gene repertoire in five 
insect and five vertebrate genomes, ranging from the core of metazoan genes 
(dark blue fraction on the left) to the species-unique sequences (white band 
on the right). The striped boxes correspond to insect- and vertebrate-specific 
orthologous genes, where the darker bands correspond to all insects or 
vertebrates (allowing one loss). N:N:N indicates orthologues present in 
multiple copies in all species (allowing one loss); patchy indicates ancient 
orthologues (requiring at least one insect and one vertebrate gene) that have 
become differentially extinct in some lineages. The species tree on the left 
(shown in detail in Supplementary Fig. 6) was computed using the 
maximum-likelihood approach on concatenated sequences of 1,150 
universal single-copy orthologues. It shows an accelerated rate of evolution 
in insects and confirms the basal position of the Hymenoptera within the 
Holometabola’’. Aaeg, Aedes aegypti; Agam, Anopheles gambiae; Amel, Apis 
mellifera; Dmel, Drosophila melanogaster; Ggal, Gallus gallus; Hsap, Homo 
sapiens; Mdom, Monodelphis domestica; Mmus, Mus musculus; Tcas, 
Tribolium castaneum; Tnig, Tetraodon nigroviridis. 
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Gene family expansion, frequently associated with a particular 
adaptation pressure, might reveal physiologically and phenotypically 
unique features of beetles (Supplementary Table 9 and protein family 
discussions below). Many duplications shaped the gene content of 
Tribolium, most notably among odorant-binding proteins and the 
CYP450 subfamilies CYP6 and CYP9 (Supplementary Fig. 11), some 
of which are involved in the development of insecticide resistance in 
the Diptera’’. Duplication of genes under copy-number selection in 
other species is indicative of species-specific neo-functionalization”. 
At least 152 genes duplicated in Tribolium have single-copy status in 
all other insects studied, including sevenfold duplication of genes 
orthologous to Drosophila CG1625, encoding a putative structural 
constituent of cytoskeleton, and human ENSP00000269392, encod- 
ing centrosomal pre-acrosome localization protein 1. 

We also analysed the phylogenetic distribution of orthologous 
gene group members to quantify evolutionary gene losses”. 
Although least affected, dozens of single-copy orthologues seem to 
be lost in each lineage. Thirty-eight such genes lost in Tribolium 
include rather unique genes, encoding phosphotriesterase-related 
protein and peroxisome assembly factor 1 (peroxin-2), compared 
to 59 such genes lost in Drosophila. Notably, for the less restricted 
fractions of orthologues (defined in Fig. 2), several hundred gene 
orthologues have been lost in each species. 


Analysis of specific gene sets 

In addition to a global automated analysis of the predicted Tribolium 
gene set, the consortium manually annotated and analysed ~2,000 
genes (some additionally subjected to RNAi and expression analysis), 
focusing on developmental processes and genes of importance for 
agriculture and pest management. 


Development 

We identified and analysed homologues of known insect and 
vertebrate developmental genes to gain novel insights into the 
molecular basis of developmental differences between Drosophila 
and Tribolium. Supplementary Table 11 lists selected Tribolium 
developmental genes and their Drosophila and Apis orthologues. 
Oogenesis. Despite profound differences in ovarian architecture— 
telotrophic versus polytrophic—we identified Tribolium orthologues 


Diptera 


(Dmel only) 


Figure 3 | Orthologous genes shared between insect and human genomes. 
The Venn diagram shows the number of orthologous groups of genes shared 
between the insect and human genomes. In addition to the majority of 
Urbilateria (last common ancestor of the Bilateria) genes shared by all the 
organisms, there are hundreds of genes that have been lost in some lineages 
(for example, only retained between human and Tribolium or human and 
honeybee, but lost in Diptera). Diptera is represented here by Anopheles 
gambiae, Aedes aegypti and Drosophila melanogaster (with numbers 
considering only D. melanogaster shown in parentheses). 
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of most Drosophila genes required for stem cell maintenance, RNA 
localization and axis formation. Like Apis, however, Tribolium lacks a 
bag of marbles orthologue, which is essential for the differentiation of 
cystoblast versus germline stem cells in Drosophila’’. Interestingly, 
an orthologue of the gene gld-1, which fulfils a similar function in 
C. elegans”, is present in Tribolium. 

Anterior—posterior patterning. Analysis of the genome sequence 
confirmed the absence of a bed orthologue in Tribolium. Instead, 
anterior patterning is synergistically organized by otd and hb 
(ref. 23). However, it is still unclear how the posterior gradient of 
Tribolium Caudal is shaped in the absence of Bicoid. Notably, 
Tribolium contains an orthologue of mex-3, a factor that translation- 
ally represses the C. elegans cad homologue. Although the Tribolium 
genome contains orthologues of the Drosophila segmentation genes, 
their functions are not entirely conserved***’. Furthermore, the 
genome reveals the unexpected polycistronic organization of a novel 
gap gene, mille-pattes”*, the transcript of which encodes several short 
peptides. 

In contrast to the classical protostomian model organisms 

Drosophila and Caenorhabditis, Hox genes in Tribolium map to a 
single cluster of ~750kb on linkage group 2. Orthologues of all 
Drosophila Hox genes and the Hox-derived genes ftz and zen are 
transcribed from the same strand, and we find no evidence for inter- 
spersion of other protein-coding genes. Taken together, these results 
suggest that the evolutionary constraints preserving Hox cluster 
integrity still function in Tribolium. 
Dorso-ventral patterning. As in Drosophila, the dorso-ventral axis of 
the Tribolium embryo depends on a nuclear gradient of Dl, an NF-KB 
protein, which is established through ventral activation of a Tl recep- 
tor” (one of four in Tribolium). Factors required for localized Tl 
activation are also present in Tribolium (potential Tl ligands: six 
spz-like genes; extracellular proteases: one gd, six snk and four tan- 
dem ea genes), suggesting that, as in Drosophila, an extra-embryonic 
signal induces the embryonic dorso-ventral axis. 

Tribolium sog inhibition of Dpp/BMP generates a patterning gra- 
dient along the dorso-ventral axis*®. Similar chordin/sog function in 
spiders and a hemichordate suggest that this may represent the ances- 
tral bilaterian condition’’. Like Apis, Tribolium lacks an orthologue of 
Drosophila scw, but knockdown of another ligand, Tribolium gbbl, 
affected the embryonic Dpp/BMP gradient. Tribolium contains 
orthologues of all five Drosophila TGF-B receptors; however, Dpp 
signalling moderators that have duplicated and diverged in 
Drosophila, such as Tol/tok and Cv/tsg, occur as a single copy in 
Tribolium. Most strikingly, Tribolium contains homologues of 
BMP10 as well as bambi, Dan and gremlin BMP inhibitors, which 
are all known from vertebrates, but are not found in Drosophila. 
The growth zone. We identified several members of the Fgf and Wnt 
signalling pathways. The expression patterns of Tribolium Fef8, 
Wntl, Wnt5 and WntD/8 (refs 31, 32) highlight the dynamic organ- 
ization of the growth zone and underline its role in axis elongation. 
Head patterning. Orthologues of 25 out of 30 key regulators of the 
vertebrate anterior neural plate are specifically expressed in the 
Tribolium embryonic head (Supplementary Table 12). Two ortholo- 
gues are not expressed in the head neuroectoderm (barH, arx) and 
three do not have Tribolium or Drosophila orthologues (vax, hesx1, 
atx). Of the canonical Drosophila head gap genes, only the late head- 
patterning function of otd is conserved. ems function is restricted to 
parts of the antennal and ocular segments, and knockdown of btd 
seems to have no phenotypic consequences. Thus, analysis of 
Tribolium genes defines a set of genes that is highly conserved in 
bilaterian head development, and underscores the derived mode of 
Drosophila head patterning. 

Leg and wing development. In contrast to Drosophila, ventral 
appendages in Tribolium develop during embryogenesis from buds 
that grow continuously along the proximo-distal axis*’. Nonetheless, 
we identified Tribolium orthologues for most of a core set of 
Drosophila appendage genes (Supplementary Table 13). On the other 
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hand, orthologues of genes not found in Drosophila, such as Wnt11, 
gremlin, Fgf8 and an F-Box gene, are expressed in the embryonic 
legs*’*’. Although their exact function in Tribolium appendages is 
not known, Fe¢f8 is essential to vertebrate limb development. 

A major innovation driving the radiation of beetles was the evolu- 

tion of a highly modified protective forewing. Expression analysis 
and RNAi experiments revealed a high degree of conservation 
between Tribolium and Drosophila wing gene networks (Supplemen- 
tary Table 13), supporting the hypothesis that sclerotized elytra 
evolved from ancestral membranous wings mainly through new 
interactions between conserved patterning modules and as yet 
unknown downstream effector genes. 
Eye development. Tribolium has orthologues of nearly all genes 
currently known to regulate specification and differentiation in the 
Drosophila retina (Supplementary Table 14). Exceptions are the 
linker protein Phyllopod and the lens crystallin protein 
Drosocrystallin, which are restricted to Diptera. Eight of fifty-seven 
investigated eye developmental genes are duplicated in the 
Drosophila genome but not Tribolium, and in four cases the 
Drosophila paralogues have similar function. This suggests a more 
dynamic evolution of Drosophila retina genes and higher genetic 
complexity, highlighting the value of Tribolium as a more ancestral 
and simply organized model of insect eye development. 


Genes relevant to pest and Tribolium biology 


Tribolium castaneum is a notorious invader of stored grains and grain 
products. Resultantly, much effort and expense is directed to find 
better ways to control this and other grain pests. Here we describe 
established and possible future pesticide targets, as well as genes 
underlying vision and taste. Finally, we describe genes forming the 
basis of systemic RNAi in Tribolium. 


Established insecticide targets 

Cys-loop ligand-gated ion channels. Members of this superfamily 
mediate chemical synaptic transmission in insects and are targets of 
successful pest control chemicals with animal health and crop pro- 
tection applications*. The Tribolium Cys-loop ligand-gated ion 
channel (Cys-loop LGIC) superfamily contains 24 genes, the largest 
known so far for insects (Drosophila and Apis superfamilies comprise 
23 and 21 genes, respectively), due in part to the additional nicotinic 
acetylcholine receptor (nAChR) subunits in Tribolium. We also 
found genes for ion channels gated by y-aminobutyric acid (y-ami- 
nobutyric acid receptors (GABARs)), glutamate (GluCls) and his- 
tamine, as well as orthologues of the Drosophila pH-sensitive chloride 
channel’. The molecular diversity of the Tribolium Cys-loop LGIC 
superfamily is broadened by alternative splicing and RNA A-to-I 
editing, which in some cases generates species-specific receptor iso- 
forms”. The Tribolium Cys-loop LGIC superfamily is the first com- 
plete set of genes encoding molecular targets of several insecticides— 
imidacloprid and other neonicotinoids (nAChRs), fipronil 
(GABARs) and avermectins (GluCls)—described for an agricultural 
pest species. 

Cytochrome P450 proteins. Most insect cytochrome P450 proteins 
(CYPs) are thought to be involved in metabolic detoxification of host 
plant allelochemicals and toxicants, and several are insecticide resis- 
tance genes**. Other CYPs act in the synthesis and degradation of 
lipid signalling molecules, such as ecdysteroids*’. Similarly to mos- 
quitoes, especially Aedes, Tribolium has an independently expanded 
CYP gene family, particularly those involved in environmental res- 
ponse (Supplementary Table 16). 

Within the Tribolium P450s, the CYP2 and mitochondrial clans 
have undergone relatively little gene expansion, lack pseudogenes, 
and are probably reserved for essential endogenous functions in 
ecdysteroid metabolism and development. In contrast, expansions 
via tandem duplication produced 85% of Tribolium P450s clustered 
in groups of 2-16 genes, with large expansions of CYP3 and CYP4 
clans involved in environmental response. In comparison, Apis has 
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only four CYP4 genes, whereas Aedes has relatively similarly sized 
expansions of CYP3 and CYP4 clans (Supplementary Table 16). We 
speculate that both mosquito larvae (which are omnivorous scaven- 
gers) and Tribolium have adapted to diverse chemical environments 
in part by expansion of CYP gene families involved in detoxification. 


Possible future insect control targets 

Cl cysteine peptidase genes. Tribolium castaneum has successfully 
exploited cereal grains in spite of the arsenal of defensive allelochem- 
icals, including inhibitors of serine peptidase digestive enzymes. In 
tenebrionid beetles, cathepsins B, L and serine peptidases such as 
trypsins and chymotrypsins are part of the digestive peptidase com- 
plex in the larval gut®®. 

Comparing potential digestive peptidase genes in Tribolium with 
those in other sequenced insects (Supplementary Fig. 12) we found 
more Cl cysteine peptidase genes in T. castaneum. The proliferation 
of Tribolium C1 cysteine peptidase genes reflects expansions into five 
gene families, corresponding to four major clusters. This expansion is 
consistent with a trend seen in some beetles relative to other insects: a 
shift to a more acidic gut, conducive to cysteine peptidase activity. 

Tribolium castaneum Cl cysteine peptidase genes encode B and L 

cathepsins, and include the first-known insect genes similar to O 
and K cathepsins (Supplementary Table 17). Most of the cathe- 
psin-B-like peptidases lack conserved residues in functional regions 
and thus may lack peptidase activity, whereas all but two Tribolium 
cathepsin L peptidase genes encode potentially functional enzymes. 
In vertebrates, O and K cathepsins are lysosomal cysteine peptidases, 
involved in bone remodelling and resorption. Analysis of Tribolium 
cathepsins may provide insight into this family of proteins whose 
elevated expression is associated with a significant fraction of human 
breast cancers and tumour invasiveness. 
Neurohormones and G-protein-coupled receptors. Insect neuro- 
hormones (neuropeptides, protein hormones and biogenic amines) 
control development, reproduction, behaviour, feeding and many 
other physiological processes, often by signalling through 
G-protein-coupled receptors (GPCRs). We found 20 genes encoding 
biogenic amine GPCRs in Tribolium (compared to 21 in Drosophila 
and 19 in Apis) and 52 genes encoding neuropeptide or protein 
hormone GPCRs (49 in Drosophila, 37 in Apis). Moreover, we 
identified the likely ligands for 45 of these 72 Tribolium GPCRs. 
Furthermore, we annotated 39 neuropeptide and protein hormone 
genes. We found excellent agreement (95%) between the proposed 
ligands for the Tribolium neurohormone GPCRs and the inde- 
pendently annotated neuropeptide and protein hormone genes. 
Interestingly, the Tribolium genome contains a vasopressin-like 
neuropeptide (TC06626) and a vasopressin-like GPCR gene 
(TC16363; Supplementary Fig. 14), neither of which has been 
detected in any other sequenced insect*’. Vasopressin in mammals 
is the major neurohormone stimulating water reabsorption in the 
kidneys”. Its presence in Tribolium may help the beetle to survive in 
very dry habitats. 


Genes relevant to Tribolium biology 

Vision. Most of the 21 investigated genes that participate in the 
Drosophila photo-transduction network are conserved in Tribolium 
(Supplementary Table 14). Most notable is the lack of ninaG and 
inaC, which may be functionally replaced by closely related para- 
logues in Tribolium. 

Tribolium contains only two opsin genes, representing members of 
the long-wavelength and _ ultraviolet-sensitivity-facilitating opsin 
subgroups. In contrast, Drosophila contains seven, and there is evid- 
ence for minimally three in most other insects. The lack of a blue- 
light-sensitive opsin gene in Tribolium is consistent with the unusual 
expression of long-wavelength opsin in all photoreceptor cells in 
this species*'. The implied reduction in colour discrimination in 
Tribolium is probably a consequence of the widespread cryptic life- 
style of this species group. 
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Odorant and gustatory receptors. Odorant and gustatory receptors 
form the insect chemoreceptor superfamily. Tribolium has a major 
expansion of both odorant and gustatory receptors relative to 
Drosophila, Anopheles and Aedes mosquitoes, silkmoth and honeybee 
(Supplementary Table 19). We identified and annotated 265 appar- 
ently functional odorant receptors, 42 full-length pseudogenes and 
34 pseudogene fragments. Most of these T. castaneum odorant recep- 
tors are in seven species-specific subfamilies, including one contain- 
ing 150 genes, and most are in tandem gene arrays, created by gene 
duplication within the Tribolium lineage in the last 300 million 
years”. 

We annotated 220 apparently functional gustatory receptors and 

25 pseudogenes (gustatory receptor gene fragments were not 
assessed). The gustatory receptor families in fruitflies and mosqui- 
toes, but not honeybee, contain several genes that are alternatively 
spliced, with multiple alternative long first exons encoding at least the 
amino-terminal 50% of the gustatory receptor spliced into a set of 
short shared exons encoding the carboxy terminus***°. Most 
Tribolium gustatory receptors are encoded by single genes; however, 
T. castaneum Gr214 is a massive alternatively spliced locus with 30 
alternative long 5’ exons (six of which are pseudogenic) spliced into 
three shared 3’ exons encoding the C terminus. Three T. castaneum 
gustatory receptors are orthologues of highly conserved gustatory 
receptors in other insects***°, two of which form a heterodimeric 
carbon dioxide receptor*®. The remainder form many species-specific 
subfamilies, one of which is expanded to 88 genes (Supplementary 
Information and Supplementary Fig. 16). 
Systemic RNAi. In Tribolium, as in C. elegans but not Drosophila, the 
RNAi effect spreads systemically from the site of injection to other 
tissues’ and from injected females to their offspring*. Surprisingly, 
our survey of genes involved in systemic RNA did not reveal much 
conservation between Tribolium and C. elegans. 

The SID-1 multi-transmembrane protein, essential for double- 
stranded RNA (dsRNA) uptake in C. elegans, is not found in 
Drosophila, suggesting that the presence or absence of a sid-1 gene 
is the primary determinant of whether or not systemic RNAi occurs 
in an organism. We found three genes in Tribolium that encode 
proteins similar to SID-1. However, their sequences are more similar 
to another C. elegans protein, TAG-130 (also known as ZK721.1), 
which is not required for systemic RNAi in C. elegans*’. Additionally, 
the secondary argonaute proteins and RNA-dependent RNA poly- 
merase (RdRP)***’, essential for the amplification of the initial 
dsRNA trigger in C. elegans, are absent in Tribolium. Therefore, the 
molecular basis for systemic RNAi in Tribolium and other insects 
might differ from that in C. elegans and remains to be elucidated. 


Concluding remarks 


We observe three trends when comparing Tribolium and other insect 
genomes. First, phylogenetic trees show shorter branch lengths for 
Tribolium (and Apis) than Drosophila. The accelerated evolution of 
the Drosophila lineage in some cases rendered Drosophila atypical for 
the Insecta. Second, Tribolium retains a different set of ancestral 
genes that have evolved at a moderate rate (for example, gremlin 
and cathepsins), and these may provide insights into the function 
of their vertebrate orthologues. Third, its own evolutionary path 
has led to beetle- and perhaps Tribolium-specific gene changes (for 
example, a large increase in odorant receptors). 

Expansions of CYP proteins, proteinases, diuretic hormones, a 
vasopressin hormone and receptor, and chemoreceptors all indicate 
adaptation to a dry, chemically diverse and toxin-rich microenviron- 
ment. Whereas the flour beetle’s drought tolerance probably explains 
the presence of vasopressin, it is more difficult to rationalize a need 
for such an unprecedented diversity of chemoreceptors. Functions 
stemming from the diversity of angiosperm-derived chemicals such 
as distant detection of food sources and avoidance of toxic host plant 
defence chemicals suggest that this expansion may be common to the 
Coleoptera. The expansion of odorant receptors is more intriguing 
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when considered in combination with the reduction of opsin genes. 
Both trends may reflect the long-term consequences of adaptation to 
low light biota by Tribolium, enforcing selection for increased dis- 
crimination of odour reception but not colour perception”. 

Given the chemo-sensing and detoxifying genes described above, it 
is perhaps no surprise that Tribolium has demonstrated resistance to 
all insecticides used for its control. Given the association of Tribolium 
with human food, knowledge ofall possible insecticide targets will aid 
greater selectivity in pesticide design, thereby mitigating possible side 
effects. Finally, the true value of this sequence may be the entry it 
provides into the many and richly diverse facets of beetle biology, 
physiology and behaviour. 


METHODS SUMMARY 


Detailed Methods are described in the Supplementary Information. Resources 
generated by this project can be found at the following locations: genome 
assemblies, sequences, and automated and manually curated gene model 
sequences are available from the BCM-HGSC website and ftp site (http:// 
www.hgsc.bcm.tme.edu/projects/tribolium/). Browser display of the genome 
sequence, all gene predictions and available tiling array data are available via 
http://www.genboree.org and Beetle Base (http://www.bioinformatics.ksu.edu/ 
BeetleBase/), a long-term repository for Tribolium data. 
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Lateral presynaptic inhibition mediates 
gain control in an olfactory circuit 


Shawn R. Olsen! & Rachel |. Wilson! 


Olfactory signals are transduced by a large family of odorant receptor proteins, each of which corresponds to a unique 
glomerulus in the first olfactory relay of the brain. Crosstalk between glomeruli has been proposed to be important in 
olfactory processing, but it is not clear how these interactions shape the odour responses of second-order neurons. In the 
Drosophila antennal lobe (a region analogous to the vertebrate olfactory bulb), we selectively removed most interglomerular 
input to genetically identified second-order olfactory neurons. Here we show that this broadens the odour tuning of these 
neurons, implying that interglomerular inhibition dominates over interglomerular excitation. The strength of this inhibitory 
signal scales with total feedforward input to the entire antennal lobe, and has similar tuning in different glomeruli. A 
substantial portion of this interglomerular inhibition acts at a presynaptic locus, and our results imply that this is mediated by 
both ionotropic and metabotropic receptors on the same nerve terminal. 


A sensory stimulus generally triggers activity in multiple neural pro- 
cessing channels, each of which carries information about some fea- 
ture of that stimulus. The concept of a processing channel has a 
particularly clear anatomical basis in the first relay of the olfactory 
system, which is typically divided into glomerular compartments. 
Each glomerulus receives input from many first-order olfactory 
receptor neurons (ORNs), all of which express the same odorant 
receptor. Each second-order neuron receives direct ORN input from 
a single glomerulus, and thus all the first- and second-order neurons 
corresponding to a glomerulus constitute a discrete processing chan- 
nel. An odorant typically triggers activity in multiple glomeruli, and 
local interneurons that interconnect glomeruli provide a substrate 
for crosstalk between channels. 

The Drosophila antennal lobe is a favoured model for investigating 
olfactory processing because it contains only ~50 glomeruli’, each of 
which corresponds to an identified type of ORN and an identified 
type of postsynaptic projection neuron (PN)*°. Several recent stud- 
ies of the Drosophila antennal lobe have produced divergent views 
regarding the relative importance of interglomerular connections. 
One model proposes that PN odour responses are almost completely 
determined by feedforward excitation®’. This model ascribes little 
importance to crosstalk between glomerular processing channels. 
An alternative model proposes that interglomerular connections 
make an important contribution to shaping PN odour responses*’. 
However, this has not been demonstrated by showing a change in PN 
odour responses when lateral inputs to that PN are removed. (We use 
the word ‘lateral’ as a synonym for ‘interglomerular’.) 

In principle, several features of olfactory processing in the 
Drosophila antennal lobe could reflect either intra- or interglomeru- 
lar events. For example, most PNs are more broadly tuned to odours 
than their presynaptic ORNs*"°. This could reflect a purely intraglo- 
merular nonlinear process, such as short-term synaptic depression 
at ORN-PN connections. Alternatively, it could be due to the fact 
that lateral excitatory connections exist between glomeruli”'””. 
It is also unclear whether inhibitory epochs in PN odour responses 
reflect inter- or intraglomerular events. Many interneurons that are 
immuno-positive for GABA (‘y-aminobutyric acid) form connections 
between glomeruli”'*"*, but several recent studies have failed to 
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observe any interglomerular inhibition”''’*. These studies removed 
all the direct ORN inputs to an identified PN, and asked whether 
lateral input could be evoked in that PN by olfactory stimulation of 
other ORN types. In all cases, lateral inputs to PNs were excitatory. 
This raises the possibility that interglomerular inhibition might not 
exist, and that inhibitory PN odour responses might merely reflect 
intraglomerular feedback'*’’”. This is an important issue because 
intra- and interglomerular inhibition have different consequences 
for how olfactory representations are transformed in this circuit. 

In this study, we address three questions. Do interglomerular 
interactions make a substantial contribution to PN odour responses? 
Do these interactions include lateral inhibition? If so, how does this 
occur, and why has it been difficult to observe? 


Removing lateral input to projection neurons 


We began by investigating what happens to PN odour responses 
when most lateral input to that PN is removed. We took advantage 
of the fact that the fly has two olfactory organs. About 90% of ORNs 
are contained in the antennae, whereas 10% are in the maxillary palp 
(Fig. la). Palp ORNs express odorant receptors not expressed in the 
antennae, and project to palp glomeruli that are distinct from glom- 
eruli targeted by antennal ORNs**. Antennal and palp glomeruli are 
interconnected by local interneurons. Acute removal of the antennae 
eliminates 90% of the input to the antennal lobe, and therefore most 
excitatory drive to local interneurons (Fig. 1b). Iflateral connections 
are mainly excitatory”’’’, then removing the antennae should 
decrease the odour responses of palp PNs. 

We performed this experiment in two different palp glomeruli 
(VM7 and VC1). Surprisingly, removing the antennae increased most 
of the odour responses of these PNs (Fig. 1c, d and Supplementary Figs 
2. and 3). No odour responses were decreased. This implies that most of 
our odours normally evoke lateral inhibitory input to these glomeruli, 
and that this outweighs the effect of lateral excitatory input. 

We examined the input-output function of each glomerulus by 
plotting the strength of each PN odour response against the strength 
of the cognate ORN response to the same odour. These input-output 
functions were nonlinear (Fig. 1d), which is typical for most glom- 
eruli'°. When we removed most lateral input to VM7 and VC1 PNs, 
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the nonlinearity persisted (Fig. 1d) and these PNs became even more 
broadly tuned (Fig. le). This argues that broad tuning of PNs*’° 
results mainly from purely intraglomerular mechanisms. These could 
include short-term synaptic depression at ORN—PN connections 
and/or an intrinsic ceiling on PN firing rates. Lateral excitation should 
tend to broaden PN tuning even more, but lateral inhibition evidently 
counteracts this. 

One clue to the significance of lateral inputs is that, in the intact 
antennal lobe circuit, PN responses cannot be predicted purely on the 
basis of feedforward excitatory inputs'®. Two odours can elicit similar 
responses in an ORN but divergent responses in a postsynaptic PN. For 
example, pentyl acetate and 4-methyl phenol evoke similar activity in 
VM7 ORNs but not in VM7 PNs (Fig. 1c), implying that these odours 
recruit different lateral inputs to this glomerulus. After antennae were 
removed, these odours evoked similar responses in VM7 PNs (Fig. Ic). 
Overall, removing the antenna increased the correlation between the 
ranked odour preferences of ORNs and their cognate PNs (Spearman’s 
p increased from 0.82 to 0.88 for VM7 and from 0.89 to 0.94 for VC1; 
P< 0.02 for each comparison, Mann—Whitney U-test). 


Lateral inhibition suppresses ORN input 


Several recent studies have failed to observe lateral inhibition in the 
Drosophila antennal lobe”''"*. These studies silenced all direct ORN 
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Figure 1| Removing lateral input disinhibits projection neurons. 

a, Drosophila olfactory organs. b, Experimental configuration. c, Spiking 
responses of ORNs (green) and PNs (blue and magenta) for the palp 
glomerulus VM7 (n = 4-12 for each response). PNs are shown with 
(magenta) and without (blue) lateral input from antennal glomeruli. The 
black bars represent the 500-msec odour-stimulus period. d, Input—output 
functions for palp glomeruli VM7 (left) and VC1 (right). Each point 
represents the average PN response to an odour versus the response of the 
cognate ORNs. PNs are shown with antennae intact (magenta) or removed 
(blue). Responses to 18 out of 20 odours are significantly disinhibited in 
VM7; 13 out of 20 are significantly disinhibited in VC1 (P< 0.05, t-tests). 
e, Odour selectivity of ORNs (green) and cognate PNs with antennae intact 
(magenta) or removed (blue). Lifetime sparseness is 0 for an unselective cell 
and 1 for a maximally selective cell. All within-glomerulus comparisons are 
significant (P< 0.002, Mann—Whitney U-test), except ORNs versus 
antennae-intact PNs for VC1. Data are shown as mean + s.e.m. 
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inputs to a PN and focused on lateral input to that PN evoked by 
stimulating ORNs presynaptic to other glomeruli. We reasoned that 
some lateral inhibition might target ORN axon terminals; if so, this 
would only be observed when the direct ORN inputs to a PN are 
active. GABAergic inhibition at ORN axon terminals in the olfactory 
bulb has been described previously’**', and synapses from 
GABAergic interneurons onto ORN axon terminals have been found 
in an insect antennal lobe”. 

To test the idea that some lateral inhibition is presynaptic, we asked 
how lateral input to glomerulus VM7 depends on the activity of VM7 
ORNs. Each VM7 ORN fires spontaneously at ~10 spikess', and 
consequently these PNs are bombarded by spontaneous spike-driven 
excitatory postsynaptic potentials (EPSPs; Supplementary Fig. 4a). 
When we silenced VM7 ORNs by removing the maxillary palps, large 
spontaneous EPSPs disappeared in VM7 PNs (Fig. 2a). In this experi- 
mental configuration, stimulating the antennae with odorants depo- 
larized VM7 PNs (Fig. 2a), which is consistent with previous reports 
that lateral input is excitatory when direct ORN inputs are silent”''””. 
Next, we asked whether preserving spontaneous activity in VM7 
ORNs would allow us to observe lateral inhibition. To prevent 
odour-evoked activity in VM7 ORNs, we covered the maxillary palps 
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Figure 2 | Lateral inhibition suppresses spontaneous EPSCs and scales 
with total ORN input. a, Left: experimental configuration. Right: recording 
from a PN in glomerulus VM/7. Olfactory stimulation evokes depolarization. 
Spontaneous EPSPs are absent. b, Left: palps are shielded from odours. 
Right: recording from a PN in glomerulus VM7. Olfactory stimulation 
suppresses spontaneous EPSPs. c, Average VM7 PN responses as in a (green) 
or b (black); n = 6—7 PNs for each condition. When ORNs are shielded 
(black), inhibition dominates. When ORNs are absent (green), excitation 
dominates throughout the stimulus period. (Off-inhibition probably reflects 
lateral postsynaptic inhibition, see Supplementary Fig. 5.) Pale bands are 
s.e.m. d, Lateral input to VM7 (as in b) is correlated with the disinhibition in 
VM7 PNs after antennal removal (see Fig. 1b). Each point represents a 
different odour. Lateral input is measured as the time-integrated change in 
membrane potential. e, Lateral input to VM7 is correlated with total 
antennal ORN spiking activity evoked by each odour. f, Lateral input to VM7 
is correlated with the disinhibition in VC1 PNs after antennal removal. 
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with a plastic shield. We inferred that the shield did not prevent 
spontaneous activity in VM7 ORNs because we observed normal 
spontaneous EPSPs in VM7 PNs (Fig. 2b). The shield was clearly 
an effective barrier to odours because it blocked the normal 
strong excitatory response to ethyl butyrate in VM7 PNs 
(Supplementary Fig. 4). Instead, when the palps were shielded, 
stimulating the antennae with odorants suppressed spontaneous 
EPSPs in VM7 PNs (Fig. 2b, c). This was accompanied by hyper- 
polarization of the membrane potential, which would reflect (at least 
in part) the removal of ongoing depolarization produced by EPSP 
bombardment. 

For all 20 odours in our panel, lateral input depolarized VM7 PNs 
when their cognate ORNs were absent (as in Fig. 2a) and hyperpo- 
larized these PNs when their ORNs were spontaneously active (as in 
Fig. 2b). This argues that a substantial component of lateral inhibi- 
tion acts at a presynaptic locus. This does not mean that all lateral 
inhibition is presynaptic; indeed there is evidence for an additional 
postsynaptic component (Supplementary Fig. 5). 


Inhibition scales with total ORN input 

When palp ORNs were shielded, different odours evoked different 
amounts of inhibition in glomerulus VM7 (Fig. 2c, black traces). We 
hypothesized that this odour tuning could explain why antennal 
removal disinhibits some VM7 PN odour responses more than others 
(Fig. 1c, d). To test this, we measured the amount of inhibition 
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Figure 3 | Lateral GABAergic suppression of ORN-PN synapses. 

a, Experimental configuration. b, Electrical stimulation of the antennal 
nerve (arrowheads) evokes EPSCs in a PN (average of 20 trials). Olfactory 
stimulation (500 ms) inhibits EPSCs. Odour also evokes a transient inward 
current reflecting lateral postsynaptic excitation; this is resistant to GABA 
antagonists (Supplementary Fig. 6). ¢, Inhibition is blocked by the GABAg 
antagonist CGP54626 (CGP) together with the GABA, antagonist 
picrotoxin (control, n = 12; PCT, n = 5; CGP, n = 5; CGP + PCT, n= 5). All 
pairwise comparisons are significantly different except control versus PCT 
(P < 0.05, t-tests). d, Experimental configuration, substituting GABA 
iontophoresis for olfactory stimulation. e, As in b for GABA iontophoresis 
(asterisk). GABA also evokes an outward current. f, As in ¢ for GABA 
iontophoresis (control, n = 13; PCT, n = 5; CGP, n = 6; CGP + PCT,n= 7). 
All pairwise comparisons between conditions are significantly different 
except control versus PCT (P< 0.05, t-tests). Data are shown as 

mean = s.e.m. 
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evoked by each odour in the shielded-palps experiment, and com- 
pared this to the change in PN spiking responses to that odour after 
antennal removal. These two measures were well correlated (Fig. 2d, 
n= 20 odours, Pearson’s 7° = 0.65, P< 0.0001), which argues that 
these two experimental paradigms measure the same underlying 
phenomenon. 

The odour tuning of lateral input must reflect the connectivity of 
the local interneurons that mediate this inhibition. Many individual 
GABAergic interneurons innervate all glomeruli®’"’, suggesting that 
lateral inhibitory input to each glomerulus might reflect pooled 
input from all ORNs. If so, then the size of lateral input to a glom- 
erulus should correlate with the total ORN activity evoked by that 
odour. We estimated total ORN activity by summing the spiking 
responses of each antennal ORN type”, and found that this measure 
predicted the strength of lateral inhibition evoked by each odour in 
the shielded-palps experiment (Fig. 2e, n= 14 odours, Pearson’s 
r = 0.73, P< 0.0005). 

If lateral inhibition to all glomeruli scales with total ORN activity, 
then lateral inhibitory input to each glomerulus would show the same 
odour tuning. We have already shown that the odour tuning of lateral 
input to VM7 is a good predictor of which odour responses were 
most disinhibited in VM7 PNs after antennal removal (Fig. 2d). As 
expected, it also partially predicted which odour responses were most 
disinhibited in a different palp glomerulus, VC1 (Fig. 2f, n= 20 
odours, Pearson’s r° = 0.32, P< 0.01). However, this correlation 
was weaker than the correlation with disinhibition in VM/7. This 
leaves open the possibility that there may be some differences in 
the odour tuning of lateral inhibitory input to different glomeruli 
(see Discussion). 


GABA mediates presynaptic inhibition 


Our results suggest that much of the lateral inhibition in this circuit 
acts by suppressing ORN-PN synaptic transmission. To test this, we 
monitored ORN-PN synaptic strength in one glomerulus while 
recruiting lateral input to that glomerulus (Fig. 3a). We recorded 
from an identified PN while electrically stimulating the ipsilateral 
antennal nerve to evoke excitatory postsynaptic currents (EPSCs). 
Next, we used an odour to stimulate ORNs in the remaining intact 
antenna (and the maxillary palps). Because most glomeruli receive 
bilateral ORN input’, olfactory stimulation of the contralateral 
antenna drives activity in ipsilateral glomeruli. Finally, we prevented 
odours from recruiting direct ORN input to the recorded PN by 
mutating the odorant receptor gene normally expressed by its ORNs. 
As predicted, olfactory stimulation of the contralateral antenna 
inhibited EPSCs evoked by ipsilateral nerve stimulation (Fig. 3b, 
c). We could mimic this inhibition by iontophoresing GABA into 
the antennal lobe neuropil (Fig. 3d—f). A GABAg receptor antagonist 
blocked the late phase of this inhibition, but had only a modest effect 
on the early phase (Fig. 3c, f). Adding a GABA, antagonist to the 
GABAg antagonist blocked the residual early portion of the inhibi- 
tion (Fig. 3c, f). The GABA, antagonist alone had no effect (Fig. 3c, f). 
Taken together, these results suggest that both GABA, and GABA, 
receptors are present on the same ORN axon terminals, and that 
either GABA, or GABAg receptors alone are sufficient to mediate 
substantial inhibition of EPSCs just after GABA release. The late 
phase of inhibition evidently involves only GABAg receptors. 
Presynaptic inhibition is generally associated with a change in the 
way in which a synapse responds to paired electrical pulses**. We 
found that both the GABA, and the GABA, components of EPSC 
inhibition are associated with an increase in the paired-pulse ratio 
(Supplementary Fig. 7). This implies that the independent actions of 
both GABA, and GABAs receptors are at least partially presynaptic. 
As a further test of this model, we genetically abolished GABA, 
signalling selectively in presynaptic ORNs. We used an ORN-specific 
promoter” to drive expression of pertussis toxin**—a selective inhi- 
bitor of some types of G proteins. In these flies, GABA still inhibited 
ORN-PN EPSCs, but now this inhibition had a briefer duration than 
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Figure 4 | Genetic evidence that GABAg receptors inhibit ORN-PN 
synapses at a presynaptic locus. a, Experimental configuration. b, When 
pertussis toxin is specifically expressed in ORNs, GABAergic inhibition of 
EPSCs (solid line, 2 = 11) is more transient than in control flies (dotted line, 
n = 13, reproduced from Fig. 3f). ¢, Pertussis toxin expression in ORNs 
renders the GABAergic inhibition of EPSCs completely insensitive to CGP, 
and completely sensitive to PCT (black trace reproduced from a; CGP, n = 5; 
PCT, n = 6; CGP + PCT, n= 5). Compare to Fig. 3f. Asterisk represents 
GABA iontophoresis. Data are shown as mean = s.e.m. 


in wild-type flies (Fig. 4a, b). Unlike in wild-type flies, this inhibition 
was completely resistant to the GABA, antagonist and completely 
blocked by the GABA, antagonist (compare Fig. 4c to Fig. 3f). As a 
negative control, we confirmed that this phenotype requires both 
the ORN-specific promoter and the toxin transgene (Supplemen- 
tary Fig. 8). This demonstrates that GABAg receptors inhibit 
ORN-PN synapses at a purely presynaptic locus. 

If activation of either GABA, or GABAg receptors is sufficient to 
mediate strong lateral inhibition, then blockade of both receptors 
should be required to mimic the removal of lateral input. To test 
this, we again recorded from palp PNs in glomerulus VM7. 
Normally, the odour pentyl acetate weakly excites VM7 ORNs and 
inhibits VM7 PNs. When most lateral input is removed (by removing 
the antennae), this odour strongly excites these PNs (Fig. 1c). We 
could not mimic this disinhibition by applying either a GABA, or a 
GABAg receptor antagonist alone. However, the two antagonists 
together produced strong disinhibition that resembled the effect of 
removing the antennae (Fig. 5). 


Discussion 


Many previous studies have shown that odours can inhibit spiking in 
olfactory bulb mitral cells and antennal lobe PNs (see refs 27—32 for 
early examples), but in principle this inhibition could be purely 
intraglomerular!>'’. Experiments in vitro have revealed several types 
of interglomerular circuits**°, but some of these circuits are evi- 
dently not recruited by olfactory stimuli’. Here we have directly 
demonstrated an important role for inhibitory interactions between 
olfactory glomeruli in vivo. 

Our results argue that a substantial component of interglomerular 
inhibition occurs at a presynaptic locus. Previous studies in other 
species have shown that GABA can inhibit release from ORN axon 
terminals'**'. Our results imply that in Drosophila this is mediated by 
both GABA, and GABAg receptors. This arrangement is unusual but 
not unique; for example, there are several instances of GABA, and 
GABAsg inhibition at the same presynaptic site in other neural cir- 
cuits**’. Ionotropic and metabotropic receptors act with different 
kinetics, and so this arrangement might ensure that inhibition spans 
a broad time window. Although both receptor types were co-active 
during most of the odour response, we noticed that GABA, receptors 
were required for a brief early phase of inhibition after odour onset 
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Figure 5 | GABA receptor antagonists mimic removal of lateral input to a 
projection neuron. a, Rasters show spiking responses to pentyl acetate (grey) 
in a VM7 ORN and a VM7 PN. With antennae intact, both antagonists are 
required to mimic the effect of antennal removal on PNs. (Some off- 
inhibition in PNs persists after antennal removal or in the presence of 
antagonists; this probably reflects the off-inhibition in VM7 ORNs.) 

b, Average spike rates during odour stimulus period, minus baseline spike 
rates (n = 5-6 PNs for each condition). c, Average membrane potential 
responses to pentyl acetate in VM7 PNs (n = 5-6 projection neurons for 
each). Data are shown as mean + s.e.m. 


(Fig. 5) whereas GABAg receptors were required for the long, late 
phase (Figs 3 and 5). 

We have shown that lateral inhibitory input to a glomerulus 
roughly scales with total ORN activity. This would imply that the 
odour tuning of GABAergic input to each glomerulus is approxi- 
mately similar. However, the effects of lateral input may nevertheless 
be somewhat glomerulus-specific. Even if the odour tuning of GABA 
release were identical in all glomeruli, the efficacy of presynaptic 
inhibition will vary with presynaptic membrane potential*®*'. As a 
result, the same inhibitory signal should be more effective in some 
glomeruli than in others. Also, in some glomeruli, lateral inhibition 
might be outweighed by lateral excitation. This would explain why 
other studies have found that some PNs can be excited by odours that 
do not excite their cognate ORNs*'°!**?, 

We propose that this form of gain control represents a flexible 
balance between sensitivity and efficiency. When total ORN input 
is weak, lateral inhibition is minimal, and ORN-PN synapses are 
strong. When an odorant recruits vigorous ORN input to many 
glomeruli, GABAergic interneurons inhibit ORN neurotransmitter 
release. This should prevent a stimulus from saturating the dynamic 
range of many types of PNs simultaneously. Because this mechanism 
suppresses responses that are strong and redundant, it may tend to 
decrease cross-correlations between the output of different glom- 
eruli, and thus promote a more efficient neural code for odours. 


METHODS SUMMARY 


In vivo whole-cell patch-clamp recordings from PNs and extracellular recordings 
from ORNs were performed essentially as described previously*''*. Recordings 
in Fig. 3a-c were made from VM2 PNs in Or43b' mutant flies. Antagonists 
(50 pM CGP54626 and 5 uM picrotoxin, PCT) were added to the saline, which 
perfused the brain. Pertussis toxin was expressed under control of the Or83b- 
GAL4 driver. All data are shown as mean values, averaged across experiments, 
+s.em. (except for raw electrophysiological traces and rasters). The odour 
stimulus period (shown as a black bar in Figures) was 500 ms, except in Fig. 3b, c. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Fly stocks. Flies were reared at 20-21 °C on conventional cornmeal agar. All 
experiments were performed on adult female flies 2-5 days post-eclosion. Fly 
stocks were provided as follows: Or43b' (D. Smith); NP5103-GAL4, NP3481- 
GAL4 and NP5221-GAL4 (K. Ito and L. Luo); Or46a-GAL4 and Or83b-GAL4 
(L. Vosshall); UAS-DTI;; and UAS-DTIj; (L. Stevens); UAS-PTX (G. Roman); 
and UAS-CD8GFP;, (Bloomington Stock Center). 

ORN recordings. Extracellular recordings of ORN spiking were performed as 
described previously'®. To accurately sort spikes recorded from VC1 ORNs, we 
killed the second ORN type (VA71) housed in the same sensillum. This was done 
by expressing diphtheria toxin light chain in the VA7] ORNs (genotype: either 
Or46a-GAL4/UAS-DTl;, (n= 4) or Or46a-GAL4/+; UAS-DTljp/+ (n= 4)). 
The genotype for VM7 ORN recordings was NP3481-Gal4 UAS-CD8GEP, which 
is the same genotype as that used for the VM7 PN recordings. VM7 ORNs fire 
spontaneously at 10 spikes s_', and VC1 ORNs fire spontaneously at 3 spikes s_'. 
Projection neuron recordings. In vivo whole-cell recordings from PNs were 
performed as previously described’. For the current-clamp recordings in 
Figs 1, 2 and 5 and in Supplementary Figs 2—5, the composition of the internal 
patch-pipette solution was (in mM): potassium aspartate 140, HEPES 10, 
MgATP 4, Na3GTP 0.5, EGTA 1, KCI 1, and biocytin hydrazide 13 (pH 7.3, 
osmolarity adjusted to ~265mOsm). For the voltage-clamp recordings in 
Figs 3 and 4 and Supplementary Figs 6-8, the composition of the internal was 
(in mM): cesium aspartate 140, HEPES 10, MgATP 4, Na3;GTP 0.5, EGTA 1, KCl 
1, biocytin hydrazide 13, and QX-314 10 (pH7.3, osmolarity adjusted to 
~265 mOsm). The composition of the external saline solution in all recordings 
was (in mM): NaCl 103, KCI 3, N-tris(hydroxymethyl) methyl-2-aminoethane- 
sulfonic acid 5, trehalose 8, glucose 10, NuaHCO3 26, NaH2PO, 1, CaCl, 1.5, and 
MgCl, 4. Osmolarity was adjusted to 270-275 mOsm. The saline was bubbled 
with 95% O3/5% COs and reached a final pH of 7.3. Saline perfused the brain 
continuously at 2mlmin~'. Recordings were obtained with an A-M Systems 
Model 2400 amplifier (100 MQ headstage), were low-pass filtered at 5 kHz, and 
digitized at 10 kHz. Data were acquired in Igor Pro. In most experiments, we 
used an enhancer trap line to label specific PNs with green fluorescent protein 
(GFP) for targeted recording. VM7 PNs were labelled by NP3481-Gal4 UAS- 
CD8GFP (Figs 1, 2 and 5, and Supplementary Figs 2, 4 and 5). VC1 PNs were 
labelled by NP5221-Gal4 UAS-CD8GEP (Fig. 1 and Supplementary Fig. 3). VM2 
PNs were labelled by NP5103-Gal4 UAS-CD8GFP (Fig. 3a—c, and Supplementary 
Fig. 6a—c and the ‘odour’ condition in Supplementary Fig. 7d). VM2 PNs were 
recorded in flies in which the odorant receptor expressed by VM2 ORNs was 
mutated (NP5103-Gal4 UAS-CD8GFP; Or43b', see schematic in Fig. 3a). In 
some experiments for each genotype, we filled the recorded cell with biocytin 
and verified the PN identity with biocytin histochemistry as described 
previously*. In the GABA iontophoresis experiments in Fig. 3d-f (also 
Supplementary Figs 6d-f and 7), we recorded from PNs in the anterodorsal cell 
cluster? labelled by the enhancer trap line GH146 (GH146-Gal4 UAS-CD8GEFP). 
For the experiments in which pertussis toxin was selectively expressed in ORNs 
(Fig. 4, also Supplementary Fig. 8), we recorded from PNs in the anterodorsal cell 
cluster in the genotype Or83b-Gal4; UAS-PTX/+. In voltage-clamp recordings, 
the command potential was either —85 mV (Fig. 3a—c) or —65 mV (Figs 3d—f 
and 4). 

Manipulation of peripheral organs. Antennal input was abolished in some 
experiments by severing the antennal nerves with fine forceps just before record- 
ing. The antennal nerve was gently broken by applying forces perpendicular to 
the long axis of the nerve, leaving a stump of nerve attached to the antennal lobe. 
In other experiments, the maxillary palps were removed with forceps just before 
recording. For both antennal and palp amputation, the cell bodies of the affected 
ORNs are removed, but the proximal portions of their axons remain intact and 
continue to innervate the antennal lobe. Amputation removes ORN somata and 
thus large spontaneous spike-driven EPSCs (~5-20pA) disappear almost 
entirely. Axon terminals are still intact, as evidenced by the persistence of mini- 
ature EPSCs (~1 pA). In some experiments, the maxillary palps were shielded 
from direct odour stimulation by covering them in ‘5 minute epoxy’ (Devcon). 
The epoxy was allowed to dry for ~20 min before the recording was started. We 
confirmed that this substance is relatively non-toxic to ORNs by covering the 
antennae in epoxy, allowing it to dry, peeling the epoxy away, and verifying that 
odours were still able to elicit a normal field-potential response in the antennae. 
Olfactory stimulation. Odours were diluted in paraffin oil at a ratio of 1:100 v/v. 
Odour dilutions in paraffin oil were prepared fresh every 5 days. Our odour 
panel consisted of benzaldehyde, butyric acid, 1-butanol, cyclohexanone, ethyl 
butyrate, ethyl acetate, ethyl-3-hydroxybutryate, fenchone, geranyl acetate, 
2-heptanone, linalool, 4-methyl cyclohexanol, methyl salicylate, 3-methylthio- 
1-propanol, 4-methyl phenol, octanal, 1-octen-3-ol, pentyl acetate, trans-2- 
hexenal and paraffin oil (solvent control). Odour source details are posted at 
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http://wilson.med.harvard.edu/odors.html. Odours were delivered with a 
custom-built olfactometer as described previously'®. Odour stimuli were applied 
for 500 ms, except in Fig. 3 in which the odour was applied for 1s. For the 
experiments in Figs 1, 2 and 5, consecutive odour presentations were spaced 
25-35 s apart. 

Direct ORN axon stimulation. In Figs 3 and 4 and in Supplementary Figs 6-8, 
we electrically stimulated ORN axons. The ipsilateral antennal nerve was severed 
and inserted into a stimulating suction electrode. Single spikes were evoked in 
ORN axons with a short current pulse (50 1s) using a stimulus isolator 
(A.M.P.L). In each sweep we delivered a train of 23 nerve stimuli at 4 Hz. 
During the train, the size of the evoked EPSCs decreased over the first 4-8 stimuli 
owing to short-term depression, but then reached a steady value. We timed our 
GABA iontophoresis and odour stimulation to occur during this steady-state 
period. For the paired-pulse experiments in Supplementary Fig. 7, we delivered 
two electrical stimuli to the nerve with an inter-pulse interval of 25 ms. This 
interval produces depression at the ORN-PN synapse under control conditions 
and allowed us to observe increases in the paired-pulse ratio when we decreased 
release probability. 

GABA iontophoresis. In the GABA iontophoresis experiments, a high- 
resistance (~80.QM) sharp glass pipette was filled with a solution of 250 mM 
GABA in water, and the pH was adjusted to 4.3 by adding HCl. GABA was ejected 
into the antennal lobe neuropil with a brief (3-20 ms) positive current pulse 
using an iontophoresis unit (World Precision Instruments). A constant negative 
backing current was applied to retain GABA in the iontophoresis pipette between 
ejection events. To ensure that a similar amount of GABA was released into the 
neuropil in all experiments, and that the time-course of GABA release was as 
consistent as possible, we adjusted the location of the iontophoresis pipette, the 
level of negative backing current and the duration of the ejection current at 
the beginning of each experiment. We did not proceed with an experiment until 
the following criteria were met: EPSC suppression was ~50% at 150 ms after 
iontophoresis, maximal EPSC suppression was ~85% at 400 ms after iontophor- 
esis, and GABA evoked an outward current in the recorded PN which lasted for 
~1-1.5s. 

Antagonists. Antagonists were prepared as concentrated stock solutions and a 
measured volume of stock was added to the saline perfusate to achieve the final 
drug concentrations. Picrotoxin (Sigma) was used at 5 uM, CGP54626 (Tocris) 
was used at 50 UM, and CdCl, (Sigma) was used at 10-25 1M. 

Peri-stimulus time histograms. For the experiments in Figs 1 and 5 and in 
Supplementary Figs 2 and 3, spike times were extracted from raw ORN and 
PN recordings using routines in Igor Pro. Each cell was tested with multiple 
odours, and each odour was presented six times. The response to the first pre- 
sentation was not included in our analysis. Each of the five remaining trials was 
converted into a peri-stimulus time histogram (PSTH) by counting the number 
of spikes in 50-ms bins that overlapped by 25 ms. These single-trial PSTHs were 
averaged together to generate a PSTH describing the response to an odour in a 
given experiment. Multiple cells corresponding to each glomerular class and 
each cell type (ORN or PN) were tested with a given odour in multiple experi- 
ments. Average PSTHs in Fig. 1 and Supplementary Figs 2 and 3 represent the 
mean + s.e.m. computed across experiments. 

Input-output functions. For the analysis of odour responses in Fig. 1d, we 
computed the average spike rate over the 500-ms odour duration and subtracted 
the baseline firing rate averaged over the 500-ms window before odour onset. 
Each point in the input-output function represents the mean PN response to an 
odour plotted against the mean response of the cognate ORNs to the same 
odour. Fits are single exponential functions. 

Lifetime sparseness. The selectivity of a neuron’s odour response profile 
(Fig. le) was quantified as lifetime sparseness: 


where N = the number of odours and r; is the analogue response intensity of the 
neuron to odour j minus baseline firing rate. Analogue response intensity was the 
mean spike rate (averaged across five sweeps) during the entire 500-ms odour 
stimulus period. Any values of r;< 0 were set to zero before computing lifetime 
sparseness (this was the only analysis in this study in which negative responses 
were zeroed). Responses to paraffin oil (solvent control) were not considered in 
the sparseness analysis. We computed the lifetime sparseness for each individual 
cell in our study for which we tested at least 12 of the odours in our set. We used 
the Mann-Whitney U-test to assess the significance of sparseness differences. 

Correlations between ranked odour preferences. To compute correlations 
between ORN and PN odour preferences, we computed Spearman’s rank cor- 
relation coefficient (~) on the odour response vectors of these neurons. This is a 


©2008 Nature Publishing Group 


doi:10.1038/nature06864 


nonparametric measure that quantifies how well any monotonic function could 
describe the relationship between ORN and PN responses. We chose this rather 
than a linear (Pearson’s) correlation coefficient because the intrinsic input— 
output function of these glomeruli is highly nonlinear (Fig. 1d). For each glom- 
erulus, we computed Spearman’s p on each pairwise combination of individually 
recorded ORNs and PNs for which we tested at least 12 of the same odours on 
both cells (for glomerulus VM7, n = 27 comparisons for ORNs compared to PNs 
with antennae and n= 38 for ORNs compared to PNs without antennae; for 
glomerulus VC1, n= 19 for ORNs compared to PNs with antennae and n = 25 
for ORNs compared to PNs without antennae). We report the mean of these 
pairwise correlations. We assessed the significance of differences in Spearman’s p 
with the Mann-Whitney U-test. 

Quantifying changes in membrane potential. In Figs 2c and 5c, we show mem- 
brane potential responses averaged across multiple experiments. To generate 
these we first averaged 4-6 consecutive sweeps within a single experiment for 
each odour and low-pass filtered this at 13 Hz to remove any spikes. Next we 
averaged these responses across experiments, with + s.e.m. shown in pastel. To 
quantify the magnitude of lateral input in Fig. 2d-f, we integrated the change in 
membrane potential (versus baseline) over a 2-s interval beginning at odour 
onset. We chose this interval because shorter intervals did not capture as much 
odour dependence in the magnitude of lateral input. In Fig. 2d, f, PN disinhibi- 
tion was computed by averaging PN firing rates over the 2s after odour onset, 
and then averaging across experiments (4-12 PNs per odour), before computing 
the difference in the PN responses with antennae versus without antennae. In 
Fig. 2e, total ORN activity for each odour was computed by summing all the 
response rates in ref. 23. There are only 14 data points in this panel because only 
14 of our odours were included in ref. 23. 

EPSC amplitudes. We measured EPSC amplitudes in Figs 3 and 4 and in 
Supplementary Figs 6-8 by first averaging together a block of 8-20 sweeps from 
the same experiment. From these averaged traces, we computed the EPSC ampli- 
tude as the average over a 0.9-ms window centred on the peak. Before averaging 
across experiments, we normalized the train of evoked EPSCs to the baseline 
EPSCs evoked just before either odour or GABA onset. 
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Crystal structures of DNA/RNA repair 
enzymes AIkB and ABH2 bound to dsDNA 


Cai-Guang Yang'*, Chengai Yi'*, Erica M. Duguid’, Christopher T. Sullivan‘, Xing Jian’, Phoebe A. Rice? 


& Chuan He! 


Escherichia coli AlkB and its human homologues ABH2 and ABH3 repair DNA/RNA base lesions by using a direct 
oxidative dealkylation mechanism. ABH2 has the primary role of guarding mammalian genomes against 1-meA damage by 
repairing this lesion in double-stranded DNA (dsDNA), whereas AIkB and ABH3 preferentially repair single-stranded DNA 
(ssDNA) lesions and can repair damaged bases in RNA. Here we show the first crystal structures of AlkB-dsDNA and 
ABH2-dsDNA complexes, stabilized by a chemical cross-linking strategy. This study reveals that AlkB uses an 
unprecedented base-flipping mechanism to access the damaged base: it squeezes together the two bases flanking the 
flipped-out one to maintain the base stack, explaining the preference of AIkB for repairing ssDNA lesions over dsDNA ones. 
In addition, the first crystal structure of ABH2, presented here, provides a structural basis for designing inhibitors of this 


human DNA repair protein. 


Cellular DNA is constantly subjected to modifications by environ- 
mental and endogenous chemicals, which can result in covalent 
changes'”. Methylating (or alkylating) agents are a common group 
of DNA modifiers that introduce damage primarily to the hetero- 
cyclic bases of DNA, with mutagenic and/or cytotoxic consequences. 
Alkylating agents are also widely used in cancer therapy and exert 
anticancer effects by creating cytotoxic DNA lesions in tumour cells. 
Many of these alkylation DNA damages are detected and repaired by 
proteins that are conserved across kingdoms. 

The E. coli AlkB protein is a direct dealkylation DNA repair 
protein*. It uses a mononuclear iron(II) site and cofactors 
2-ketoglutarate (2KG) and dioxygen to perform an unprecedented 
oxidative demethylation of DNA base lesions 1-meA, 3-meC, 1-meG 
and 3-meT (Supplementary Fig. 2)°"'. AlkB also removes etheno 
DNA lesions by using a similar oxidation mechanism'”"’. There are 
nine potential human homologues of AlkB. Two of these, ABH2 and 
ABH3, can repair the same spectrum of base lesions as AlkB'*"*. A 
third one, FTO, has been recently shown to repair 3-meT in DNA, 
and it affects obesity in mammals through an unknown mech- 
anism'’. ABH2 acts as the primary housekeeping enzyme in mam- 
mals for repairing endogenously formed 1-meA lesions in duplex 
DNA". The in vivo function of ABH3 is still unclear but it has been 
shown to repair base lesions in RNA as well as DNA". Although the 
structures of AlkB (with dT-(1-medA)-dT)'? and ABH3 (in the 
absence of DNA or RNA)” were solved recently, the molecular mech- 
anism underlying the different substrate preferences of ABH2 and 
AlkB remains unknown. Furthermore, the structure of ABH2 has, to 
our knowledge, never been reported. 


Cross-linking to stabilize protein-DNA complexes 

We report here the first crystal structures of AlkB—dsDNA and 
ABH2-dsDNA complexes. The AlkB family proteins bind DNA 
weakly”! and form labile complexes with damage-containing DNA”, 
which makes crystallization of their protein-DNA complexes challen- 
ging. To overcome this difficulty we used chemical cross-linking 


methods”; initially using an active site disulphide cross-linking 
strategy that we developed previously (Fig. la)”*’®. Base-repair 
proteins flip damaged bases and insert them into the active site for 
processing. Therefore, we reasoned, a cysteine residue engineered into 
the active site of AIkB may form a disulphide cross-link, at equili- 
brium, with a disulphide-modified cytosine (C* in a C*:A base pair) 
flipped into the active site of the repair protein (Fig. 1a)”’. Of several 
mutations tested, AlkB D135C and ABH2 E175C gave the best cross- 
linking yields with disulphide-modified DNA under equilibrium 
conditions (Supplementary Fig. 3). 


Structure determination 

For crystallization, truncated but catalytically active AlkB-AN11"” 
and ABH2-AN55 were used (Supplementary Fig. 4). Of multiple 
DNA sequences screened, a 13-base-pair duplex DNAI1, with a 
central C*:A base pair (Fig. 1a), gave the best crystals of the active 
site cross-linked AlkB—DNA complex. These crystals diffracted to 
1.65 A and the structure was solved by molecular replacement using 
the previously reported AlkB structure as the search model’. Two 
crystal forms of active site cross-linked ABH2—DNA complexes were 
grown using different 13-base-pair dsDNAs containing central C*:A 
pairs. The first, with DNA2, diffracted to 2.0 A, and was solved by 
multi-wavelength anomalous dispersion (MAD) phasing using 
bromine-labelled (5-Br dU) DNA. The second form, with DNA3, 
diffracted to 3.0A and was solved by molecular replacement using 
the first as a search model. Both the AlkB and the ABH2 truncations 
used in this study still efficiently repair 1-meA in dsDNA with the 
sequences shown in the crystal structures (Supplementary Figs 5 
and 6). 


General features of the protein-DNA interactions 

AlkB and ABH2 have similar folds (a common jelly-roll fold that 
is shared by all iron(II)/2-KG-dependent oxygenases**”°), and both 
flip the damaged base into a deep active site pocket. However, their 
interactions with dsDNA are very different (Fig. 1b-e). First, AIkB 
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interacts almost exclusively with the damaged strand, whereas ABH2 
makes significant contacts with both DNA strands (Supplementary 
Fig. 7). Second, the two enzymes use very different mechanisms to flip 
the damaged base. ABH2 uses an aromatic finger residue, Phe 102, 
which resembles the intercalating hairpin motif observed with DNA 
glycosylases*’’, to intercalate into the duplex stack and fill the DNA 
gap resulting from the base flipping. AlkB lacks this finger residue; 
instead it manipulates the DNA backbone near the flipped base such 
that the bases flanking it stack with one another. Two slightly different 
DNA conformations were observed in the AIkKB—DNA1 structure, but 
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i 4 Cys 
Disulphide s~ 
cross-linking \ ‘s 
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1 
Rona 
c ABH2-DNA2 


Figure 1| Crystal structures of the active site cross-linked AIkB-DNA1 and 
ABH2-DNA2 complexes. a, The active site cross-linking strategy. 

A disulphide-modified cytosine C* (n = 1 or 2) in a mismatched C*:A base 
pair can be flipped into the active site of AIkB or ABH2. A covalent 
disulphide bond can form between C* and an engineered Cys in the active 
site of the repair protein under equilibrium conditions. b, Cartoon of the 
AlkB-DNA1 complex. Two slightly different DNA conformations are 
observed in this structure; one is shown here, the other in Supplementary 
Fig. 8. The protein is shown in green (the cross-linking residue Cys 135 is 
labelled), manganese(II) in orange, 2KG in blue, DNA in yellow-orange, the 
flipped base C8* and the two bases flanking C8* in light magenta, the 
disulphide bond in red and the orphaned base A8’ in cyan. The approximate 
location of the disordered base T7’ is shown as a dotted line. ¢, Cartoon trace 
of the ABH2—DNA2 complex. The same colour coding as in b is used. The 
finger residue Phe 102 and the cross-linking residue Cys 175 are green, and 
the orphaned base A7’ is cyan. The location of a DNA-binding loop 
containing a RKK sequence is indicated. d, e, Same structure as in b and 

c, rotated 90° to the right. A DNA-binding flexible long loop is labelled. See 
Supplementary Fig. 15 for a topological diagram of ABH2. 
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in both cases a similar distortion of the lesion-containing strand was 
observed (Fig. 1b, d, and Supplementary Fig. 8). 


Cross-linking AIkB to dsDNA away from the active site 

The protein portion of the AIkB—DNA1 structure overlaps almost 
perfectly with the previously published AIkB structure (Supplemen- 
tary Fig. 9a). However, to confirm that the DNA distortions observed 
are not due to cross-linking to the active site of the protein, we also 
engineered a disulphide cross-link away from the active site, to AIkB 
S129C (Fig. 2a)***’. Two such AlkB—dsDNA complexes were crystal- 
lized: one with DNA4, a 13-base-pair dsDNA containing C10*, 1-meA 
and an abasic site (Fig. 2b); and one with DNAS, containing C10* two 
base pairs away from the 1-meA (Fig. 2c). These were solved at 1.68 A 
and 1.90 A, respectively, by molecular replacement. 


Unique binding of duplex DNA by AIkB 


All three structures of the AlkB-dsDNA complexes show a base- 
flipping feature distinct from all known structures of base-flipping 
proteins bound to DNA. The two bases that flank the flipped 1-meA, 
A7 and A9, are ‘squeezed’ such that they stack on one another, about 
3.4A apart (Fig. 3a and Supplementary Fig. 10c). The sugar rings of 
these two nucleosides adopt unusual conformations. One (AQ) is 
C3'-endo, standard in A-form DNA (Supplementary Fig. 10a). The 
other (A7) shows the most surprising feature: with its base, it is 
twisted by about 180°, resulting in an inverted sugar (O4’ points 
towards the 3’ end of the DNA strand) while the base stacks with 
A9:T9' (Fig. 3a). The inversion of this sugar ring is locally enforced by 
residues 51-53 (Thr-Pro-Gly), which form a rigid hydrogen bonding 
network with the phosphate group between A7 and A6 (Fig. 3a and 
Supplementary Fig. 10b), and through multiple protein-DNA inter- 
actions anchoring the two phosphate groups flanking the 1-meA 
(Supplementary Fig. 7a). These features were not observed in the 
AIkB structure containing a short T-(1-meA)-T’’, most likely 
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Figure 2 | Crystal structures of AlkB-DNA4 and AlkB-DNAS complexes 
with 1-meA recognized by an intact active site. a, A strategy to set up a 
disulphide cross-link away from the active site of AlkB. b, Cartoon of the 
AlkB—DNA4 complex with 1-meA (light magenta) recognized in the active 
site of AlkB. The complex was stabilized by a disulphide link between an 
engineered Cys 129 (green) and C10* that is two base pairs away from 
1-meA. The colour coding is the same as in Fig. 1b. DNA4 has an abasic site 
‘LY (position 7') 3’ to T8’ of the 1-meA:T base pair. ¢, Cartoon trace of the 
AlkB—DNA5 complex; DNAS has a central 1-meA:A base pair replacing a 
C*:A in DNAI and a C10* cross-linking site two base pairs away from 
1-meA. 
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because the ssDNA trimer is too short to reach much of the DNA- 
binding surface of AIkB. Asa result, the positions of the two bases that 
flank the 1-meA are different in the previous AIkB structure and the 
current AlkB—dsDNA structures (Supplementary Fig. 9c). It remains 
to be seen how a long ssDNA interacts with AlkB. 

In all three AlkB—dsDNA structures, the protein stabilizes only the 
distorted conformation of the DNA immediately flanking the 1-meA; 
the complementary strand appears to accommodate this through 
spontaneous conformational rearrangements. The base opposite 
the 1-meA (T8’ in DNA4 or A8’ in DNAI1 and DNAS) intercalates 
between A7 and A6 to generate a one-base-wide stack containing 
bases from both strands, which maintains the length of the normal 
duplex DNA (Fig. 3a and Supplementary Fig. 10c). The base at the 7’ 
position is forced out of the helix and is disordered in our structures 
(in DNA4 it is replaced with an abasic site) (Figs 1b and 2b, c). To 
support the observation that the complementary strand distorts 
to accommodate base flipping in AlkB—dsDNA complexes, we 
measured the context dependence of AlkB-mediated 1-meA repair 
(Supplementary Fig. 11). AlkB can catalytically repair this lesion in 
the dsDNA sequence used in our crystals (replacing C8*:A8’ with 
l-meA:T in DNA1), which has two A:T base pairs flanking 
1-meA:T. When the two flanking base pairs (7:7’ and 9:9’) were 
replaced with G:C, repair activity dropped approximately threefold. 
The activity of this substrate was restored to the original level by 
incorporating an abasic site at position 7’, which destabilizes the base 
pair containing the base that is forced out of the double helix in our 
structures. The highest catalytic repair activity was observed with the 
ssDNA strand containing 1-meA, as expected. 


ABH2 binding to duplex DNA 


The structure of ABH2 differs from AlkB in having extra DNA- 
binding motifs that grasp the complementary strand of the duplex 
DNA (Fig. 1). ABH2 uses a short loop with a positively charged RKK 
sequence (Arg 241-Lys 243) and an additional long, flexible loop 
carrying DNA-binding residues Arg 198, Gly 204 and Lys 205 to 
bind the opposite DNA strand (Fig. lc, e and Supplementary Fig. 
7b). The two bases that flank the flipped out C7*, A8 and T6, are 
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Figure 3 | Close views of the base-flipping regions. a, AIkB—DNA4 
complex, with the same colouring as in Fig. 1b. A7 stacks with the A9:T9’ 
base pair. Thr 51 and Gly 53 are hydrogen bonded to the phosphate linking 
A7 and A6 (Supplementary Fig. 10b). The sugar ring of A9 adopts the 3’- 
endo conformation whereas the sugar ring of A7 is forced to be inverted by 
about 180° by the protein. T8’ from the complementary strand intercalates 
between A7 and A6, forming a one-base-wide stack containing bases from 
both strands. b, ABH2—DNA2 complex, with the same colour coding as in 
Fig. 1b. Phe 102 inserts into the DNA duplex with A7’ as the orphaned base. 
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approximately the same distance from each other as they would be in 
canonical duplex DNA (Fig. 3b). The orphaned base A7’ is flexible 
and poorly ordered. Because the normal duplex DNA length is main- 
tained by the intercalating finger Phe 102, no further distortion of the 
DNA duplex is induced by ABH2. 

To capture the structure of ABH2 with an intact active site, we 
engineered a disulphide cross-link away from the active site, to ABH2 
G169C***, The cross-linked complex with a new 13-base-pair 
duplex probe containing C9* (DNA6; Fig. 4a and Supplementary 
Fig. 12) was crystallized and the structure was solved at 2.35 A by 
molecular replacement. A cofactor-containing structure was also 
determined (at 2.6 A) by soaking these crystals in 2KG and Mn** 
(which occupies Fe**-binding site but does not support catalysis). 
The cofactors are bound by ABH2 ina standard conformation resem- 
bling that seen in AlkB and other iron(II)/2KG-based oxygenases’*”®. 
The metal ion is ligated by His 171, Asp 173 and His 236, a bidentate 
2KG and a water molecule in an octahedral geometry (Fig. 4b, and 
Supplementary Figs 13 and 14). 


Recognition of 1-meA by ABH2 

The ABH2-dsDNA6 structure provides an undistorted view of the 
active site (Fig. 4b). A total of five active site residues are engaged in 
the recognition of 1-meA. Phe 124 and His 171 stack against the 
1-meA; an analogous stacking interaction is also observed for 
1-meA recognition by Trp 69 and His 131 in AlkB (Supplementary 
Fig. 9c). Three residues, Tyr 122, Glu 175 and Asp 174, together with a 
water molecule, form an extensive hydrogen-bonding network that 
contacts the N6 and N7 nitrogen atoms of 1-meA in ABH2. This tight 
recognition of 1-meA is not observed in the AlkB structure, as only 
two residues, Asp 135 and Glu 136 (mediated through a water mole- 
cule), were involved in the recognition of the N6 nitrogen of 1-meA 
by AlkB'’. The positively charged 1-meA may be favourably recog- 
nized through electrostatic interaction with the nearby carboxylate of 
Glu 175 (Asp 135 of AlkB) and stacking with the m system of Phe 124 
(Trp 69 of AlkB). These features may help ABH2 and AIkB discri- 
minate against the neutral base A. 


Discussion 


The AIkB family oxygenases represent a novel class of DNA base- 
repair enzymes that use an oxidative dealkylation mechanism. 
Certain members of this family, AIkB and ABH3, exhibit a distinct 
preference to repair ssDNA and RNA. We present the first structures 
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Figure 4 | Crystal structure of the ABH2-DNA6 complex with 1-meA 
recognized by an intact active site. a, Cartoon trace of the ABH2-DNA6 
complex with 1-meA recognized in the active site of ABH2 and a disulphide 
cross-link (Cys 169 to C9*) installed two base pairs away from the flipped 
base. The same colour scheme is used as in Fig. 1b. b, View of the active site of 
ABH2 with manganese(II) (orange), 2KG (blue) and 1-meA (light magenta). 
N6 of 1-meA is within hydrogen-bond distance to Tyr 122 (3.3 A) and Glu 
175 (2.8 A). N7 forms a hydrogen bond to a water (3.0 A) which also interacts 
with Asp 174 (2.6 A) and Glu 175 (2.6 A). 
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of AlkB—dsDNA and ABH2-dsDNA complexes stabilized by chemi- 
cal cross-linking, showing how AIkB and ABH2 bind and repair 
damage in dsDNA. The AlkB protein, lacking a finger residue, facili- 
tates base flipping by squeezing together the two bases flanking the 
flipped-out one such that they stack on one another. This AlkB- 
mediated DNA distortion, together with the fact that AlkB mostly 
binds to the lesion-containing strand of DNA, explains the preference 
of AlkB for ssDNA substrates. The presence of the complementary 
strand in dsDNA rigidifies the duplex structure and serves as a ‘non- 
competitive’ inhibitor. The binding energy of complex formation 
must be used to pay an energetic penalty to flip 1-meA from a duplex 
DNA. A much lower energetic penalty would be associated with AlkB 
binding to the same base lesion in flexible ssDNA. 

The ABH2-DNA structures clearly show ABH2 as a dsDNA repair 
protein, as the protein interacts extensively with both strands of 
dsDNA and uses a finger residue to intercalate in the DNA duplex 
to maintain continuous stacking. It has been commonly observed 
that single-base DNA glycosylases tend to contact only the lesion- 
containing strand of duplex DNA*'***°. The extra capacity to interact 
with the other strand of DNA allows ABH2 to repair dsDNA lesions 
preferentially, which correlates well with its primary role as an 
enzyme that repairs 1-meA damage in mammalian genomes. Our 
structures also reveal a more complex arrangement of protein- 
(1-meA) interactions in the active site of ABH2 than AlkB, which 
provides a structural basis for designing small molecules that may 
shut down the ABH2-based repair activity potentially to improve 
efficacy of alkylation anticancer treatments. 

Now with structures of AlkB—dsDNA, ABH2—dsDNA and ABH3 
available, comparisons can be made (Fig. 5 and Supplementary Fig. 
15). Surprisingly, ABH3, which shows substrate preferences similar 
to AlkB, shares some of the DNA-binding motifs of ABH2 in an 
overlay of these three structures. The long loop that binds the second 
DNA strand in ABH2 (between B11 and B12) is present in the 
sequence though disordered in the structure of ABH3 (between 
B10 and B11), but is much shorter in AlkB. Both ABH2 and ABH3 
also contain a B-sheet hairpin motif (B3—B4 and B4—B5, respectively) 
that is absent in AlkB. In ABH2 this hairpin contains the aromatic 
finger residue, Phe 102, which is not conserved in ABH3. ABH3 also 
lacks the RKK motif used by ABH2 to contact the complementary 
strand of double-stranded DNA. How ABH3 may use these motifs to 
bind ssDNA or RNA preferentially remains to be seen in future 
structures of ABH3—DNA complexes. 

Both AlkB and ABH2 are weak DNA-binding proteins. They likely 
interact with other DNA-binding proteins to stay close to DNA. 
ABH2, with a finger residue, may search for DNA damage by similar 
mechanisms proposed for DNA repair glycosylases*’**. AlkB, using a 
very different base-flipping mechanism, may impose a strain to the 


Figure 5 | Structural comparison of AlkB, ABH2 and ABH3 (stereo view). 
Least-squares superposition of structures of AlkB (in magenta, from 
AIkB-DNAI without showing DNA1), ABH2—DNA2 (protein in green and 
DNA backbone in orange) and ABH3 (in blue, from Protein Data Bank 
21UW). The hairpin and flexible long-loop motifs are shaded, and the finger 
residue Phe 102 from ABH2 is labelled in green. 
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backbone of duplex DNA to facilitate identification of weakened base 
pairs, as suggested by the structures presented here. However, how 
does it locate damaged bases in flexible ssDNA? Further studies are 
required to answer these intriguing questions. 

Lastly, the active site cross-linking strategy to trap labile protein— 
DNA complexes could be powerful for structurally characterizing 
other base-flipping proteins such as FTO and ABH3. As demon- 
strated here, once detailed protein-DNA interactions are revealed, 
a cross-link away from the active site can be readily engineered to 
provide native-like structures of the protein-DNA complexes. 


METHODS SUMMARY 


The disulphide cross-linked protein-DNA complexes were purified using 
Mono-Q anion exchange chromatography, and crystals were grown by hanging- 
drop vapour diffusion. Diffraction data were collected from cryo-preserved 
crystals at beamlines 19BM and 14BM at the Advanced Photon Source, 
Argonne National Laboratory, Argonne, Illinois. The structures of AlkB— 
dsDNA complexes were solved by molecular replacement, and the ABH2- 
dsDNA structure was solved by MAD phasing with a bromine-labelled DNA. 
Data collection and refinement parameters for all structures are given in 
Supplementary Tables 1 and 2. Detailed procedures are presented in 
Supplementary Information. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Oligonucleotide synthesis. Oligonucleotides containing disulphide-tethered 
cytosine were prepared by incorporating the O4-triazolyl-dU-CE phosphorami- 
dite (Glen Research, Inc.) at the desired position during solid-phase synthesis”. 
All synthetic oligonucleotides were purified with denaturing polyacrylamide gel 
electrophoresis or reverse-phase high-performance liquid chromatography. 
Cross-linking, purification and crystallization of the AlkB—dsDNA com- 
plexes. A truncated AlkB with deletion of the amino (N)-terminal 11 amino 
acids'” was cloned into a pET30a vector (Novagen) and overexpressed in E. coli 
BL21(DE3). The protein was purified following a previously described proce- 
dure*'. The Asp 135 to Cys mutation was introduced using the QuikChange 
Site-Directed Mutagenesis Kit (Stratagene). Synthetic oligonucleotides (DNA1: 
5'-TAGGTAAC*ATCGT-3’ and its complement 5’-AACGATATTACCT-3’, 
where C* is a disulphide-tethered cytosine) were prepared, purified and cross- 
linked to the D135C mutant AIkB as described”. To cross-link away from the 
active site, a similar procedure was applied to the $129C mutant of AlkB and 
DNAS (5'-TAGGTAA(1-meA)AC*CGT and its complement 5’-AACGGTAT- 
TACCT-3’ (the 1-meA phosphoramidite from ChemGenes, Inc.)) and DNA4 
(5'-TAGGTAA(1-meA)AC*CGT paired with 5’-AACGGTTUTACCT-3’, 
where ‘ stands for an abasic site). The covalently linked AlkB-dsDNA com- 
plexes were purified using Mono-Q anion exchange chromatography (GE 
Healthcare), after which the buffer was exchanged to 100 mM NaCl, 10mM 
Tris-HCl (pH 7.4), and the complexes were concentrated to 12-15mgul'. 
The complexes were crystallized by the hanging drop method at 4°C. MnCl, 
(1.0 mM) and 2KG (2.0 mM) were added to the complexes, and they were mixed 
in a 1:1 ratio with well solution containing 100mM NaCl, 25mM MgCl, 
100mM cacodylate (pH 6.5) and 20% w/v PEG 8K. Crystals appeared after 
2-4 days and were allowed to grow for several weeks. They were transferred into 
a cryo-protectant solution containing the reservoir solution plus 20% glycerol, 
and frozen in liquid nitrogen for X-ray data collection. 

Cloning, expression and purification of ABH2-AN55. Limited protease diges- 
tion with trypsin and sequence alignment of ABH2 with E. coli AlkB and ABH3 
led us to design a new construct of ABH2 lacking the first 55 N-terminal amino 
acids. The abh2-AN55 gene was cloned between the Ndel and HindIII sites of a 
pET28a+ vector (Novagen). This plasmid was transformed into BL21 (DE3) E. 
coli cells (Stratagene) for protein overexpression. The protein was purified using 
Ni-NTA chromatography (GE Healthcare) with an elution buffer of 50mM 
sodium phosphate (pH 8.0), 300mM NaCl and 400mM imidazole. The 
N-terminal His-tag was removed by an overnight thrombin digestion at 4°C 
and the tag-free protein was purified the next day with another round of Ni-NTA 
chromatography. About 4-6 mg of protein could be obtained from 11 of bac- 
terial cells. All other mutants of ABH2-AN55 used in this study were generated 
by using QuikChange Site-Directed Mutagenesis Kit and expressed and purified 
by the same method. 

Crystallization of the ABH2—dsDNA complexes. Three additional mutations, 
C67S, C165S and C192S, were introduced into ABH2-AN55 E175C to avoid 
non-specific disulphide cross-linking. The mutant protein was used for cross- 
linking with synthetic oligonucleotides DNA2 (5'-CTGTATC*ATTGCG-3’' 
paired with 5'-TCGCAATAATACA-3’) for the complex that gave the high- 
resolution structure, and DNA3 (5'-AGGTGAC*AATGCG-3’ paired with 5’- 
TCGCATTATCACC-3’) for the complex that afforded the low-resolution struc- 
ture. An ABH2-AN55 G169C mutant was cross-linked to DNA6 (5'- 
CTGTAT(1-meA)AC*TGCG-3’ paired with 5'-TCGCAGTTATACA-3') to 


nature 


yield the complex that afforded the structure of a disulphide cross-link away 
from the active site. The covalently linked ABH2—dsDNA complexes were puri- 
fied using Mono-Q anion exchange chromatography, after which the buffer was 
exchanged to 100mM NaCl, 10mM Tris-HCl (pH 8.0), and the complexes 
concentrated to 5mg ml _'. Hexagonal rod-shaped crystals grew in 2-4 weeks 
at 4°C, in hanging drops containing 1 ul of complex solution and 1 pl of res- 
ervoir solution of 100 mM NaCl, 50 mM MgCl and 100 mM cacodylate (pH 6.5) 
and 8% 8K PEG. Crystals were transferred to a cryoprotectant solution of res- 
ervoir solution plus 24% glycerol, and were frozen in liquid nitrogen for data 
collection. Cofactor-containing ABH2—DNA2 crystals grew in 2—4 days at room 
temperature, in hanging drops containing ll of complex solution 
(5.0 mg ml ', 100mM NaCl, 10mM Tris-HCl, pH 7.4, 1.0mM MnCh, 
2.0mM 2KG) and 1 ul of reservoir solution of 100 mM NaCl, 50mM MgCl 
and 100 mM cacodylate (pH 6.1) and 16% PEG 4K. MnCl, (2.5mM) and 2KG 
were soaked into the ABH2—DNA6 crystals for 4.5h to give (Mn/2KG)ABH2-— 
DNA6 crystals. 

Structure determination and refinement. The AIkKB—DNA complex structure 
was phased by molecular replacement (using EPMR” and Phaser**), using the 
previously published AIkB structure as a search model. The model was built by 
using COOT” and refined with the program REFMACS from the CCP4 suite*®. 
To obtain phase information for the ABH2—DNA2 complex structure, 5-BrU- 
containing oligonucleotide (DNA2: 5’-CTGXATC*AXXGCG-3’ annealed to 5'- 
TCGCAAXAAXACA-3’, where X stands for 5-Br dU) was used to grow crystals 
with ABH2. A two-wavelength anomalous diffraction data set was collected at 
wavelengths of 0.9192A (peak) and 0.9199 A (inflection) at beamline 19BM at 
the Structure Biology Center at the Applied Photon Source. All X-ray data were 
integrated and scaled using HKL2000 program suite“! and converted to structure 
factors with the CCP4 program. The positions of five bromine atoms from the 
MAD data were located and the initial electron density map was calculated using 
the program SOLVE”. Refmac5 was used to refine the model against the 2-A 
resolution native data. An initial model was built into the electron-density map 
manually in COOT. The structure was completed using iterative rounds of 
manual model building in COOT and computational refinement in either 
CNS* or CCP4. Phases for all other ABH2 data sets were determined by molecu- 
lar replacement using Phaser**. Data collection and refinement parameters for all 
structures are given in Supplementary Tables 1 and 2. Molecular graphics figures 
were prepared with PyYMOL™. 
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The inner jet of an active galactic nucleus as revealed 
by a radio-to-y-ray outburst 


Alan P. Marscher', Svetlana G. Jorstad’”, Francesca D. D’'Arcangelo’, Paul S. Smith®, G. Grant Williams’, 
Valeri M. Larionov’, Haruki Oh't, Alice R. Olmstead’, Margo F. Aller, Hugh D. Aller?, lan M. McHardy’®, 
Anne Lahteenmaki’, Merja Tornikoski’, Esko Valtaoja®’, Vladimir A. Hagen-Thorn*, Eugenia N. Kopatskaya’, 
Walter K. Gear'°, Gino Tosti’!, Omar Kurtanidze'*, Maria Nikolashvili’”, Lorand Sigua'*, H. Richard Miller’? 


& Wesley T. Ryle’? 


Blazars are the most extreme active galactic nuclei. They possess 
oppositely directed plasma jets emanating at near light speeds 
from accreting supermassive black holes. According to theoretical 
models, such jets are propelled by magnetic fields twisted by dif- 
ferential rotation of the black hole’s accretion disk or inertial- 
frame-dragging ergosphere’’. The flow velocity increases outward 
along the jet in an acceleration and collimation zone containing a 
coiled magnetic field**. Detailed observations of outbursts of elec- 
tromagnetic radiation, for which blazars are famous, can poten- 
tially probe the zone. It has hitherto not been possible to either 
specify the location of the outbursts or verify the general picture of 
jet formation. Here we report sequences of high-resolution radio 
images and optical polarization measurements of the blazar BL 
Lacertae. The data reveal a bright feature in the jet that causes a 
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Figure 1| Sequence of Very Long Baseline Array images of BL Lac at a 
wavelength of 7 mm (and a frequency of 43 GHz). The images are 
convolved with a circular gaussian function (shown in the bottom-left 
corner) that has a full width at half maximum of 0.1 mas, the resolution of 
the longest baselines of the array, which corresponds to 0.12 pc at the 
distance of 291 Mpc derived from the redshift, z = 0.069, and the Hubble 
Law with an assumed slope of Hyp = 72kms_ | Mpc! (ref. 22). The vertical 
axis shows angular distance relative to the core. The contours indicate total 


double flare of radiation from optical frequencies to TeV y-ray 
energies, as well as a delayed outburst at radio wavelengths. We 
conclude that the event starts in a region with a helical magnetic 
field that we identify with the acceleration and collimation zone 
predicted by the theories. The feature brightens again when it 
crosses a standing shock wave corresponding to the bright ‘core’ 
seen on the images. 

The jet of BL Lac (Fig. 1) approaches us within 6—10° of our line of 
sight at a flow speed of 0.981—0.994c (where c is the speed of light), 
corresponding to a Lorentz factor of 7.0 + 1.8 (ref. 6). Relativistic 
aberration and the Doppler effect strongly beam the radiation, result- 
ing in the apparent luminosity being hundreds of times higher than it 
would be if the emitting plasma were at rest. An essentially identical 
counterjet is presumably present, but too faint to detect because of 


colour scale represents polarized intensity, and peaks (yellow) at 0.215 Jy per 
beam. The yellow line segments indicate the electric vector position angle 

(EVPA) of the polarization, which has an uncertainty of +7° in each image. 
The EVPA has been rotated by 16° in the clockwise direction to compensate 
for Faraday rotation; see the Supplementary Information. The approximate 
path of the centroid of the knot, the definition of which includes a point from 
an image obtained on 12 March 2006 in addition to those shown, is given by 
the slanted red line. The knot’s observed proper motion of 1.2 mas yr‘ is 


equivalent to an apparent speed of 5.0c after correction for time dilation due 
to cosmological redshift. 


flux intensity, starting at 0.023 Janskys (Jy) per beam 
(1Jy=10 ergs ‘cm 7Hz ') and increasing in factors of two. The 


‘Institute for Astrophysical Research, Boston University, 725 Commonwealth Avenue, Boston, Massachusetts 02215, USA. *Astronomical Institute of St Petersburg State University, 
Universitetskij Prospect 28, Petrodvorets, 198504 St Petersburg, Russia. *Steward Observatory, “MMT Observatory, University of Arizona, Tucson, Arizona 85721-0065, USA. 
°Astronomy Department, University of Michigan, 830 Dennison Building, Ann Arbor, Michigan 48109-1090, USA. “Department of Physics and Astronomy, University of 
Southampton, Highfield, Southampton SO17 1BJ, UK. 7Metsahovi Radio Observatory, Helsinki University of Technology TKK, Metsadhovintie 114, 02540 Kylmala, Finland. ®Tuorla 
Observatory Vaisalantie 20, 21500 Piikkid, Finland. 7Department of Physics, University of Turku, 20014 Turku, Finland. '°School of Physics and Astronomy, Cardiff University, Cardiff 
CF24 3YB, UK. "Department of Physics, University of Perugia, Via A. Pascoli, 06123 Perugia, Italy. '’Abastumani Astrophysical Observatory, Mt Kanobili, Abastumani, Georgia. 
Department of Physics and Astronomy, Georgia State University, Atlanta, Georgia 30303, USA. +Present address: Department of Physics, University of California, Berkeley, 
California 94720-7300, USA. 


966 
©2008 Nature Publishing Group 


NATURE|Vol 452|24 April 2008 


beaming in the opposite direction. The stationary core lies at the 
northern end of the visible jet. Bright ‘knots’ emerge from the core 
at a rate of 1-2 per year and move south at apparent superluminal 
speeds, an illusion caused by their relativistic motion’. 

The radio, optical, and X-ray light curves in Fig. 2 indicate a double 
flare in late 2005. The highly significant detection’ of >0.2 TeV y-rays 
from 2005.819 to 2005.831 during the first X-ray flare implies that 
acceleration of electrons with sub-TeV energies was particularly effi- 
cient at this time. These electrons can both produce X-rays from 
synchrotron radiation and scatter the X-ray photons to GeV y-ray 
energies that are boosted to the TeV range by relativistic motion of 
the jet plasma. The location of such flares has been controversial: 
some observations*” indicate that they occur downstream of the core, 
whereas most theoretical models require that they take place well 
upstream of this region, where the plasma is more compact. As we 
explain below, our data indicate that the first flare in late 2005 corre- 
sponds to a disturbance passing through the zone upstream of the 
core, where the jet flow is still accelerating, and that the second occurs 
as the disturbance crosses a standing shock system in the core. 

The identification of the location of the initial flare within the 
acceleration and collimation zone is significant, since previous obser- 
vations of jet collimation are quite limited. For example, an image’ 
7-mm wavelength of the radio galaxy M87 appears to reveal an ini- 
tially broad outflow that narrows into a nearly cylindrical jet. This is 
consistent with gradual collimation by either a toroidal magnetic 
field* or external confining gas pressure that decreases with distance 
from the black hole'’. The flow seen in M87 could include a ‘sheath’ 
that moves more slowly and is less focused than the ‘spine’!’. In the 
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case of BL Lac, the high apparent superluminal motions of bright 
knots in the jet and the pronounced variability at all wavelengths 
imply that the observed radiation arises exclusively from the spine, 
where special relativistic effects dominate. 

The primary observational indicator of magnetic collimation 
requiring a coiled magnetic field in the spine of the jet is the evolution 
of the polarization. When observed at an angle to its axis, synchro- 
tron radiation from a circularly symmetric jet with a helical field 
displays a net polarization oriented either parallel or perpendicular 
to the projected jet axis'*. Such parallel and perpendicular polariza- 
tions can be confused with shock waves and velocity shear, respec- 
tively, which can produce the same polarization patterns. However, 
in a model where magnetic forces gradually accelerate and focus the 
jet, the flow velocity is directed along streamlines that follow a helical 
trajectory with a different, wider, pitch angle than that of the mag- 
netic field*. The rotation of the flow can be traced back to the base of 
the jet in the orbiting accretion disk or differentially rotating ergo- 
sphere, where the spin of the black hole drags the inertial frames. A 
shock wave or other compressive feature propagating down the jet 
traces a spiral path that follows a streamline and cycles through the 
orientations of the helical field (see Fig. 3 and ref. 5). This should 
manifest itself as a rotation of the position angle of linear polarization 
as the feature moves outward. The degree of polarization should drop 
to a minimum in the middle of the rotation, when the mean magnetic 
field in the flaring region is transverse to that of the previously exist- 
ing emission". As Fig. 2g, h demonstrates, we see both effects. 

The optical EVPA shown in Fig. 2g rotates steadily by about 240° 
over a five-day interval before settling at a value of ~195°. The 
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Figure 2 | Flux density at various wavebands and optical polarization of BL 
Lac, as functions of time. a—d, Dependence on time of the flux of radiation 
from BL Lac over a two-year interval at the indicated wavebands. The X-ray 
measurements in a are of photon energy flux F integrated over photon 
energies of 2-10 keV. Error bars represent +1 s.d. uncertainties in the values 
plotted. The exponent of the power-law dependence of X-ray flux density on 
frequency is denoted by —«,. e-h, Enlargements of the 0.25-yr time interval 
marked by vertical dotted lines in panels a—d, but with optical R-band EVPA 
(g) and degree of polarization P (h) respectively replacing X-ray spectral 
index (b) and radio flux density (d) (whereas e and f respectively show the 
magnified intervals in a and c). Error bars represent +1 s.d. The interval of 
highly significant detections’ at photon energies >0.2 TeV is indicated by 
the width of the head of the arrow in e. The rotation in optical R-band EVPA 
near the time of the peak of the first optical and X-ray flare is apparent. 
Because there is an ambiguity of +180° in the value of the EVPA, we have 
selected the quadrant of each value that provides a consistent overall trend of 
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rotation between 2005.81 and 2005.83. The solid curve in g corresponds to 
the pattern predicted by the model shown in Fig. 3 when relativistic 
aberration is included. The vertical arrow (with error bar) in h indicates the 
time at which the superluminal knot is coincident with the stationary core 
seen in the images displayed in Fig. 1. Optical polarimetric data were 
obtained from Steward Observatory and the Crimean Astrophysical 
Observatory. Optical flux density points were obtained from photometry at 
these two sites plus Lowell Observatory, Perugia University Astronomical 
Observatory and the Abastumani Astrophysical Observatory. All of the 
optical telescopes are equipped with charge-coupled-device cameras. 
Measurements of X-ray flux and the continuum spectrum were obtained 
from a monitoring program with the NASA Rossi X-ray Timing Explorer. 
Measurements of radio flux density were obtained from the University of 
Michigan Radio Astronomy Observatory and the Metsahovi Radio 
Observatory. Descriptions of telescopes and data analysis are available in the 
Supplementary Information. 
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sequence of images in Fig. 1 shows a bright, superluminal knot that 
first appears upstream of the core. It subsequently moves past the 
core and proceeds down the jet at a position angle of ~190° and with 
an EVPA that is parallel to the jet to within the observational uncer- 
tainty. The close correspondence between the optical and 7-mm 
EVPAs after 29 October implies that the knot is the emitter of the 
polarized optical emission during the flare. 

Previous authors’*'® have suggested that rotations of the polariza- 
tion vector occur in BL Lac and the similar active galactic nucleus 
OJ 287. Their observations, which were more poorly sampled than 
ours, allowed multiple interpretations owing to the + 180° ambiguity 
of the EVPA. Despite this, the model that we advocate is quite similar 
to one of those proposed previously’, with the location of the 
emission region and connection with high-energy flares now spe- 
cified by our sequences of Very Long Baseline Array images and 
multiwaveband light curves. 

We interpret the event in the following manner (see Fig. 3). 
Explosive activity at the inlet of the jet near the black hole injects a 
surge of energy into the jet across part of its cross-sectional area. This 
disturbance appears as a knot of emission as it propagates along a 
subset of streamlines through the acceleration and collimation zone. 
Doppler beaming of radiation emitted by high-energy electrons in 
the disturbance increases as the knot accelerates along its spiral path, 
which stretches out with distance down the jet. These effects cause the 
flux of synchrotron radiation from the knot to rise until it dominates 
the optical, X-ray and (through inverse Compton scattering) y-ray 
emission from BL Lac as the disturbance exits the zone of helical 
magnetic field. Maximum beaming—and therefore the peak in the 
light curve of the first flare—occurs during the last spiral, when the 
Lorentz factor of the jet is near its asymptotic value and the velocity 
vector of the knot points most closely towards our line of sight. The 
peak can be quite sharp’, as observed. At the point when the flare 
dominates the optical flux, we see the optical polarization vector 
rotate before the shock exits the acceleration and collimation zone. 
This zone is opaque at radio wavelengths, owing to synchrotron self- 
absorption; hence, the first flare is absent in the radio light curves. 

Beyond the acceleration and collimation zone, the disturbance 
forms a moving shock wave that encounters a region of turbulence, 
which is possibly driven by velocity shear across the jet® downstream 
of the point at which the magnetic and particle energy densities reach 
rough equipartition*. The ambient magnetic field in the jet has a 
chaotic structure in this region. Because the shock front amplifies 
only the component of the field that is parallel to the front, the 
EVPA becomes transverse to this direction and therefore essentially 


Helical magnetic field 


Moving emission feature 


TO 


Acceleration and collimation zone---~~ 


Conical standing shock 


aS 


Millimetre-wave core 


Turbulent plasma 


; a Accretion disk 


0 10Rs 10°Re 10°Re 104R, 


Distance from black hole 


10°Re 


Figure 3 | Proposed model for the inner jet of BL Lac. A shock propagates 
down the jet along a spiral streamline. The first flare occurs during the last 
240° twist of the streamline before the flow straightens and becomes 
turbulent. The passage of the feature through the millimetre-wave core 
stimulates the second flare. A logarithmic scale of distance from the black 
hole, shown in terms of the Schwarzschild radius (Rs), is used to illustrate 
phenomena on various scales. 
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parallel to the velocity vector of the knot at a position angle of ~190°. 
During this phase, the flux lessens as the knot continues down the 
broadening jet, where the magnetic field strength and electron den- 
sity decrease. 

According to the model we propose here, the variation of EVPA 
with time should deviate from a strict linear dependence owing to 
projection effects, because the circular cross-section has an elliptical 
shape from our viewpoint. We have calculated this effect, including 
relativistic aberration, and show in Fig. 2g that the optical EVPA data 
do follow the predicted curve. The small number of brief excursions 
of the EVPA from the curve, the deviations from the mean EVPA 
before and after the rotation, and irregularities in the light curves can 
all be explained by local flare-ups of emission that briefly amplify 
both the polarization along a particular direction and the flux at 
various wavebands. 

The smoothness of the plot of EVPA versus time eliminates 
the possibility’®’” that the rotation is actually a random walk of the 
polarization vector due to a chaotic magnetic field. If this were the 
case, our numerical simulations (see ref. 17) indicate that the curve 
would be much more jagged than is observed when the degree of 
polarization is ~5%. In the simulations, this level of polarization 
corresponds to synchrotron emission from ~200 independent cells, 
each with a randomly oriented magnetic field. Apparent rotations by 
~240° are very rare in such simulations, whereas they are common 
during flares of BL Lac and similar objects". 

Both synchrotron radiation and inverse Compton scattering con- 
tribute to the X-ray emission from BL Lac, with synchrotron radi- 
ation dominating when electrons are accelerated to energies in the 
TeV range’*'’. This generally causes the plot of flux density (F,) 
versus frequency (v) to steepen in the X-ray range in such a way that 
the spectral index « is greater than one, where F, x v *. Such X-ray 
spectral steepening occurs during the first flare. In contrast, the X-ray 
spectrum becomes harder (2<1) during the second flare, as is 
expected if the X-rays are mostly generated by inverse Compton 
scattering of optical and infrared photons. 

The second flare, which started at 2005.89, is simultaneous with 
the passage of the knot through the core seen on the Very Long 
Baseline Array images. If the core were a standing conical shock, as 
has been determined from simultaneous radio and optical polariza- 
tion variability in the case of the quasar PKS 0420—014 (ref. 18), the 
emission would increase as the knot undergoes compression by the 
shock front. The flare dies down at optical and X-ray frequencies as 
the knot propagates away from the core down the expanding jet. 
However, it lasts much longer at 43 GHz, at which frequency the 
synchrotron radiation requires lower-energy electrons that have 
longer energetic lifetimes than those emitting at higher frequencies. 

In the Supplementary Information we relate the angular velocity of 
the feature, inferred from the rotation of the optical polarization 
vector, to the rotational velocity of the base of the magnetic field near 
the black hole. We find that the rotational velocity thus obtained is 
consistent with the predictions of models in which the jet is driven by 
twisting magnetic fields from either the accretion disk’**° or the 
ergosphere of the black hole**”°”". 

The combination of densely sampled sub-milliarcsecond imaging 
using the Very Long Baseline Array, polarimetry, and multiwaveband 
flux measurements has allowed us to explore the inner jet of BL Lac. 
Future data from more sensitive y-ray Cherenkov detectors and the 
NASA Gamma-ray Large Area Space Telescope will soon allow more 
refined analyses. The ability to detect emission upstream of the core 
at 7-mm wavelength indicates that increasing the resolution of very- 
long-baseline interferometry by adding antennas in high Earth orbits 
will provide more detailed direct imaging of the inner jets of active 
galactic nuclei. 
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A topological Dirac insulator in a quantum spin Hall 


phase 


D. Hsieh’, D. Qian’, L. Wray’, Y. Xia’, Y. S. Hor’, R. J. Cava? & M. Z. Hasan’? 


When electrons are subject to a large external magnetic field, the 
conventional charge quantum Hall effect’? dictates that an elec- 
tronic excitation gap is generated in the sample bulk, but metallic 
conduction is permitted at the boundary. Recent theoretical 
models suggest that certain bulk insulators with large spin-orbit 
interactions may also naturally support conducting topological 
boundary states in the quantum limit*~, which opens up the pos- 
sibility for studying unusual quantum Hall-like phenomena in 
zero external magnetic fields®. Bulk Bi,_,Sb, single crystals are 
predicted to be prime candidates”* for one such unusual Hall 
phase of matter known as the topological insulator”"’. The hall- 
mark of a topological insulator is the existence of metallic surface 
states that are higher-dimensional analogues of the edge states that 
characterize a quantum spin Hall insulator*’. In addition to its 
interesting boundary states, the bulk of Bi,_,Sb, is predicted to 
exhibit three-dimensional Dirac particles'*’, another topic of 
heightened current interest following the new findings in two- 
dimensional graphene'*” and charge quantum Hall fractionali- 
zation observed in pure bismuth’'. However, despite numerous 
transport and magnetic measurements on the Bi,_,Sb, family 
since the 1960s’, no direct evidence of either topological Hall 
states or bulk Dirac particles has been found. Here, using 
incident-photon-energy-modulated angle-resolved photoemis- 
sion spectroscopy (IPEM-ARPES), we report the direct obser- 
vation of massive Dirac particles in the bulk of Bip,.9Sbo.;, locate 
the Kramers points at the sample’s boundary and provide a com- 
prehensive mapping of the Dirac insulator’s gapless surface 
electron bands. These findings taken together suggest that the 
observed surface state on the boundary of the bulk insulator is a 
realization of the ‘topological metal’”""'. They also suggest that this 
material has potential application in developing next-generation 
quantum computing devices that may incorporate ‘light-like’ bulk 
carriers and spin-textured surface currents. 

Bismuth is a semimetal with strong spin—orbit interactions. Its 
band structure is believed to feature an indirect negative gap between 
the valence band maximum at the T-point of the bulk Brillouin zone 
and the conduction band minima at three equivalent L-points'”” 
(here we generally refer to these as a single point, L). The valence 
and conduction bands at L are derived from antisymmetric (L,) 
and symmetric (L,) p-type orbitals, respectively, and the effective 
low-energy hamiltonian at this point is described by the (3+1)- 
dimensional relativistic Dirac equation'*'°. The resulting dispersion 
relation, E(k) = + (v-k)* + A?=v-k, is highly linear owing to the 
combination of an unusually large band velocity v and a small gap 4 
(such that |A/|v|| ~ 5 X 107° A~!) and has been used to explain vari- 
ous peculiar properties of bismuth’*”’*. Substituting bismuth with 
antimony is believed to change the critical energies of the band struc- 
ture as follows (see Fig. 1). At an Sb concentration of x ~ 4%, the gap 


A between L, and L, closes and a massless, three-dimensional (3D) 
Dirac point is realized. As xis further increased this gap re-opens with 
inverted symmetry ordering, which leads to a change in sign of A at 
each of the three equivalent L-points in the Brillouin zone. For con- 
centrations greater than x~ 7% there is no overlap between the 
valence band at T and the conduction band at L, and the material 
becomes an inverted-band insulator. Once the band at T drops below 
the valence band at L, at x ~ 8%, the system evolves into a direct-gap 
insulator whose low-energy physics is dominated by the spin—orbit- 
coupled Dirac particles at L”"”. 

Recently, semiconductors with inverted bandgaps have been pro- 
posed to manifest the two-dimensional (2D) quantum spin Hall 
phase, which is predicted to be characterized by the presence of 
metallic one-dimensional edge states*>!*. Although a band-inver- 
sion mechanism and edge states have been invoked to interpret the 
transport results in 2D mercury telluride semiconductor quantum 
wells'’, no one-dimensional edge states are directly imaged, so their 
topological character is yet to be uniquely determined. Recent theor- 
etical treatments”* have focused on the strongly spin—orbit-coupled, 
band-inverted Bi, —,Sb, series as a possible 3D bulk realization of the 
quantum spin Hall phase in which the one-dimensional edge states 
are expected to take the form of 2D surface states’ that may be 
directly imaged and spectroscopically studied, making it feasible to 
identify their topological order parameter character. 

High-momentum-resolution angle-resolved photoemission spec- 
troscopy performed with varying incident photon energy (IPEM- 
ARPES) allows for the measurement of electronic band dispersion 
along various momentum space (k-space) trajectories in the 3D bulk 
Brillouin zone. ARPES spectra taken along two orthogonal cuts 
through the L-point of the bulk Brillouin zone of Big 9Sbo,; are shown 
in Fig. la, c. A A-shaped dispersion whose tip lies less than 50 meV 
below the Fermi energy Ep can be seen along both directions. 
Additional features originating from surface states that do not dis- 
perse with incident photon energy are also seen. Owing to the finite 
intensity between the bulk and surface states, the exact binding 
energy Ex where the tip of the A-shaped dispersion lies is unresolved. 
The linearity of the bulk A-shaped bands is observed by locating the 
peak positions at higher EF in the momentum distribution curves and 
the energy at which these peaks merge is obtained by extrapolating 
linear fits to the momentum distribution curves. Therefore, 50 meV 
represents a lower bound on the energy gap 4 between L, and L,. The 
magnitudes of the extracted band velocities along the k, and k, direc- 
tions are 7.9+0.5 X 10*ms! and 10.0 + 0.5 X 10°ms_', respec- 
tively, which are similar to the tight binding values of 7.6 X 10*'ms ! 
and 9.1 X 10°ms_ ' calculated for the L, band of bismuth”. Our data 
are consistent with the extremely small effective mass of 0.002m, 
(where m, is the electron mass) observed in magneto-reflection mea- 
surements on samples with x= 11% (ref. 23). The Dirac point in 
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graphene, coincidentally, has a band velocity (|vp| ~10°ms_')'*® 
comparable to what we observe for Big.9Sbo,;, but its spin—orbit 
coupling is several orders of magnitude weaker, and the only known 
method of inducing a gap in the Dirac spectrum of graphene is by 
coupling to an external chemical substrate”’. The Bi, Sb, series thus 
provides a rare opportunity to study relativistic Dirac hamiltonian 
physics in a 3D condensed matter system where the intrinsic (rest) 
mass gap can easily be tuned. 

Studying the band dispersion perpendicular to the sample surface 
provides a way to differentiate bulk states from surface states in a 3D 
material. To visualize the near- Ep dispersion along the 3D L—X cut (X 
is a point that is displaced from L by a k,-distance of 3m/c, where c is 
the lattice constant), in Fig. 2a we plot energy distribution curves 
(EDCs) taken such that electrons at Ep have fixed in-plane 
momentum (k,, ky) = (Ly Ly) = (0. 8A” "0. oA” '), as a function of 
photon energy hv. There are a prominent features in the EDCs: a 
non-dispersing, k,-independent peak centred just below Ep, at about 
—0.02 eV; a broad, non-dispersing hump centred near —0.3 eV; and 


k = (0.8, k,, 2.9) = 


L+ dk, 


L-0.1 
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a strongly dispersing hump that coincides with the latter near 
hy = 29eV. To understand which bands these features originate 
from, we show ARPES intensity maps along an in-plane cut KMK 
(parallel to the k,-direction) taken using hv values of 22 eV, 29 ev and 
35 eV, which correspond to approximate k, values of L, — 0.3 eae 
and L,+0.3A~', respectively (Fig. 2b). At hy =29eV, the low- 
energy ARPES spectral weight reveals a clear A-shaped band close 
to Ep. As the photon energy is either increased or decreased from 
29 eV, this intensity shifts to higher binding energies as the spectral 
weight evolves from the A-shaped band into a ‘U’-shaped band. 
Therefore, the dispersive peak in Fig. 2a comes from the bulk valence 
band, and for hv = 29 eV the high-symmetry point L = (0.8, 0, 2.9) 
appears in the third bulk Brillouin zone. In the maps of Fig. 2b with 
respective hv values of 22 eV and 35 eV, overall weak features near E; 
that vary in intensity remain even as the bulk valence band moves far 
below Ey. The survival of these weak features over a large photon 
energy range (17—55eV) supports their surface origin. The non- 
dispersing feature centred near —0.3 eV in Fig. 2a comes from the 
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Figure 1| Dirac-like dispersion near the L-point in the bulk Brillouin zone. 
Selected ARPES intensity maps of Bip. 9Sbo,; are shown along three 

k-space cuts through the L-point of the bulk 3D Brillouin zone. The 
presented data are taken in the third Brillouin zone with L, = 2.9 A~' witha 
photon energy of 29 eV. The cuts are along the k,-direction (a); a direction 
rotated by approximately 10° from the k,-direction (b); and the k,-direction 
(c). Here, 6 symbolizes a change along a particular k-direction. Each cut 
shows a A-shaped bulk band whose tip lies below the Fermi level, signalling a 
bulk gap. The surface states are denoted SS and are all identified in Fig. 2 (for 
further identification via theoretical calculations, see Supplementary 
Information). d, Momentum distribution curves corresponding to the 


intensity map in a. f, A log-scale plot of the momentum distribution curves 
corresponding to the intensity map in c. The red lines are guides to the eye 
for the bulk features in the momentum distribution curves. e, Schematic of 
the bulk 3D Brillouin zone of Bi, —,Sb, and the 2D Brillouin zone of the 
projected (111) surface. The high-symmetry points I’, M and K of the surface 
Brillouin zone are labelled. The schematic evolution of bulk band energies as 
a function of x is shown. The L-band inversion transition occurs at x ~ 0.04, 
where a 3D gapless Dirac point is realized, and the composition we study 
here (for which x = 0.1) is indicated by the green arrow. A more detailed 
phase diagram based on our experiments is shown in Fig. 3c. 
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higher-binding-energy (valence band) part of the full spectrum of 
surface states, and the weak non-dispersing peak at —0.02 eV reflects 
the low-energy part of the surface states that cross Ep away from the 
point M and forms the surface Fermi surface (Fig. 2c). 

Having established the existence of an energy gap in the bulk state 
of Bip.oSbo., (Figs 1 and 2) and observed linearly dispersive bulk 
bands uniquely consistent with strong spin-orbit coupling model 
calculations'*"'®** (see Supplementary Information for full com- 
parison with the theoretical calculation), we now discuss the topo- 
logical character of its surface states, which we found to be gapless 
(Fig. 2c). In general, the states at the surface of spin—orbit-coupled 
compounds are allowed to be spin-split owing to the loss of space 
inversion symmetry (E(k,}) = E(—k,})). However, as required by 
Kramers’ theorem, this splitting must go to zero at the four time- 
reversal-invariant momenta in the 2D surface Brillouin zone. As 
discussed in refs 7 and 9, along a path connecting two time- 
reversal-invariant momenta in the same Brillouin zone, the Fermi 
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Figure 2 | Dispersion along the cut in the k,-direction. Surface states are 
experimentally identified by studying their out-of-plane momentum 
dispersion through the systematic variation of incident photon energy. 

a, EDCs of Big 9Sbo,; with electrons at the Fermi level (Ez) maintained at a 
fixed in-plane momentum of (k, = 0.8 All, k, = 0.0 A_!) are obtained as a 
function of incident photon energy to identify states that exhibit no 
dispersion perpendicular to the (111) plane along the direction shown by the 
double-headed arrow labelled ‘3’ in the inset (see Methods for the detailed 
procedure). Selected EDC data sets with photon energies ranging from 28 eV 
to 32 eV in steps of 0.5 eV are shown for clarity. The non-energy-dispersive 
(k,-independent) peaks near Ey are the surface states. b, ARPES intensity 
maps along cuts parallel to the k,-direction taken with electrons at Ey fixed at 
k, =0.8A | with respective photon energies of hy = 22 eV, hv = 29 eV and 
hy = 35 eV (for a conversion map from photon energy to k,, see 
Supplementary Information). The faint A-shaped band at hv = 22 eV and 
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energy inside the bulk gap will intersect these singly degenerate sur- 
face states either an even or an odd number of times. When there are 
an even number of surface state crossings, the surface states are topo- 
logically trivial because weak disorder (as may arise through alloying) 
or correlations can remove pairs of such crossings by pushing the 
surface bands entirely above or below Ep. When there are an odd 
number of crossings, however, at least one surface state must remain 
gapless, which makes it non-trivial”. The existence of such topo- 
logically non-trivial surface states can be theoretically predicted on 
the basis of the bulk band structure only, using the Z,-invariant that 
is related to the quantum Hall Chern number™. Materials with band 
structures with Z, = +1 are ordinary Bloch band insulators that are 
topologically equivalent to the filled-shell atomic insulator, and are 
predicted to exhibit an even number (including zero) of surface state 
crossings. Materials with bulk band structures with Z, = —1, on the 
other hand, which are expected to exist in rare systems with strong 
spin-orbit coupling acting as an internal quantizing magnetic field 
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hy = 35 eV show some overlap with the bulk valence band at L (hv = 29 eV), 
suggesting that it is a resonant surface state degenerate with the bulk state in 
some limited range of momenta near Ef. The flat band of intensity centred 
about —2 eV in the hy = 22 eV scan originates from Bi 5d core-level emission 
from second-order light. c, Projection of the bulk Brillouin zone (black lines) 
onto the (111) surface Brillouin zone (green lines). The overlay (enlarged in 
inset) shows the high-resolution Fermi surface of the metallic surface state 
mode, which was obtained by integrating the ARPES intensity (taken with 
hy = 20 eV) from —15 meV to 10 meV with respect to Ep. The six teardrop- 
shaped lobes of the Fermi surface close to IT’ (the centre of the Brillouin zone) 
show some intensity variation between them that is due to the relative 
orientations between the axes of the lobes and the axis of the detector slit. The 
six-fold symmetry was however confirmed by rotating the sample in the k,-k, 
plane. EDCs corresponding to the cuts A, B and C are also shown; these 
confirm the gapless character of the surface states in bulk insulating Bip 9Sbo 1. 
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on the electron system® and inverted bands at an odd number of high- 
symmetry points in their bulk 3D Brillouin zones, are predicted to 
exhibit an odd number of surface state crossings, precluding their 
adiabatic continuation to the atomic insulator*’"*. Such ‘topological 
metals’ cannot be realized in a purely 2D electron gas system. 

In our experimental case, namely the (111) surface of Bip 9Sbo,;, the 
four time-reversal-invariant momenta are located at I and three M- 
points that are rotated by 60° relative to one another. Owing to the 
three-fold crystal symmetry (A7 bulk structure) and the observed 
mirror symmetry of the surface Fermi surface across k,.= 0 (Fig. 2), 
these three M-points are equivalent (and we henceforth refer to them 
as a single point, M). The mirror symmetry (E( k,) = E(—k,)) is also 
expected, from time-reversal invariance exhibited by the system. The 
complete details of the surface state dispersion observed in our 
experiments along a path connecting I’ and M are shown in Fig. 3a; 
finding this information is made possible by our experimental sepa- 
ration of surface states from bulk states. As for bismuth, two surface 
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bands emerge from the bulk band continuum near I to form a 
central electron pocket and an adjacent hole lobe**”’. It has been 
established that these two bands result from the spin-splitting of a 
surface state and are thus singly degenerate”””*. 

On the other hand, the surface band that crosses E, at 
—k,=0.5 A and forms the narrow electron pocket around M, is 
clearly doubly degenerate, as far as we can determine within our 
experimental resolution. This is indicated by its splitting below Ep 
between —k,~ 0.55 A”! and M, as well as the fact that this splitting 
goes to zero at M in accordance with Kramers’ theorem. In semime- 
tallic single-crystal bismuth, only a single surface band is observed to 
form the electron pocket around M (refs 29 and 30). Moreover, this 
surface state overlaps, and hence becomes degenerate with, the bulk 
conduction band at L (L projects to the surface point M) owing to the 
semimetallic character of bismuth (Fig. 3b). In Big 9Sbo.;, on the other 
hand, the states near M fall completely inside the bulk energy gap, 
preserving their purely surface character at M (Fig. 3a). The surface 
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Figure 3 | The topological gapless surface states in bulk insulating 
Bip.9Sbo. a, The surface-band-dispersion second-derivative image of 

Big 9Sbo,, along ! —M. The shaded white area shows the projection of the 
bulk bands based on ARPES data, as well as a rigid shift of the tight binding 
bands to sketch the unoccupied bands above the Fermi level. A non-intrinsic 
flat band of intensity near E; generated by analysis of the second-derivative 
image was rejected to isolate the intrinsic dispersion. The Fermi crossings of 
the surface state are denoted by yellow circles, with the band near 

—k,~ 0.5 A~! counted twice owing to double degeneracy. The red lines are 
guides to the eye. An in-plane rotation of the sample by 60° produced the 
same surface state dispersion. The EDCs along I — M are shown in the right- 
hand diagram. There are a total of five crossings between I’ and M, which 
indicates that these surface states are topologically non-trivial. The number 
of surface state crossings in a material (with an odd number of Dirac points) 
is related to the topological Z, invariant (see text). b, The resistivity curves of 
Bi and Big 9Sbo,; reflect the contrasting transport behaviours. The presented 
resistivity curve for pure bismuth has been multiplied by a factor of 80 for 
clarity. c, Schematic variation of bulk band energies of Bi,_,Sb, as a 


function of x (based on band calculations and on refs 7 and 17). Big. gSbo,; is a 
direct-gap bulk Dirac point insulator well inside the inverted-band regime, 
and its surface forms a ‘topological metal’—the 2D analogue of the one- 
dimensional edge states in quantum spin Hall systems. d, ARPES intensity 
integrated within +10 meV of E; originating solely from the surface state 
crossings. The image was plotted by stacking along the negative k,-direction 
a series of scans taken parallel to the k,-direction. e, Outline of the Big Sb, 
surface state ARPES intensity near E; measured in d. White lines show the 
scan directions ‘1’ and ‘2’. f, Surface band dispersion along direction ‘1’ taken 
with hv = 28 eV, and the corresponding EDCs (g). The surface Kramers 
degenerate point, critical in determining the topological Z, class of a band 
insulator, is clearly seen at M, approximately 15 + 5 meV below Eg. (We note 
that the scans are taken along the negative k,-direction, away from the bulk 
L-point.) h, Surface band dispersion along direction ‘2’ taken with 

hy = 28 eV, and the corresponding EDCs (i). This scan no longer passes 
through the M-point, and the observation of two well-separated bands 
indicates the absence of Kramers degeneracy as expected, which 
corroborates the result in a. 
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Kramers doublet point can thus be defined in the bulk insulator 
(unlike in bismuth***°) and is experimentally located in Big 9Sbo.1 
samples to lie approximately 15 + 5 meV below Eg at k=M (Fig. 3a). 
For the precise location of this Kramers point, it is important to 
demonstrate that our alignment is strictly along the I-M line. To 
do so, we contrast high-resolution ARPES measurements taken along 
the IM line with those that are slightly offset from it (Fig. 3e). 
Figures 3f1 show that with k, offset from the Kramers point at M 
by less than 0.02 A~', the degeneracy is lifted and only one band 
crosses E, to form part of the bow-shaped electron distribution 
(Fig. 3d). Our finding of five surface state crossings (an odd rather 
than an even number) between I’ and M (Fig. 3a), confirmed by our 
observation of the Kramers degenerate point at the time-reversal- 
invariant momentum, indicates that these gapless surface states are 
topologically non-trivial. This corroborates our bulk electronic 
structure result that Bip. Sbo, is in the insulating band-inverted 
(Z, = —1) regime (Fig. 3c), which contains an odd number of bulk 
(gapped) Dirac points, and is topologically analogous to an integer- 
quantum-spin Hall insulator. 

Our experimental results taken collectively strongly suggest that 
Big.9Sbo,; is quite distinct from graphene'®*”” and represents a novel 
state of quantum matter: a strongly spin—orbit-coupled insulator 
with an odd number of Dirac points and a negative Z, topological 
Hall phase. Our work points to future possibilities for further spec- 
troscopic investigations of topological orders in quantum matter. 


METHODS SUMMARY 


High-resolution IPEM-ARPES data have been taken at beamlines 12.0.1 and 
10.0.1 of the Advanced Light Source at the Lawrence Berkeley National 
Laboratory, as well as at the PGM beamline of the Synchrotron Radiation 
Center in Wisconsin, with photon energies ranging from 17 eV to 55eV and 
energy resolutions ranging from 9 meV to 40 meV, and momentum resolution 
better than 1.5% of the surface Brillouin zone. Data were taken on high-quality 
bulk single-crystal Bi, Sb, at a temperature of 15 K and chamber pressures less 
than 8 X 10° '' torr. Throughout this paper, the bulk bands presented are from 
those measured in the third bulk Brillouin zone, to ensure a high degree of signal- 
to-noise contrast, and the k, values are estimated using the standard free-electron 
final-state approximation. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 25 November 2007; accepted 14 February 2008. 


1. von Klitzing, K., Dorda, G. & Pepper, M. New method for high-accuracy 
determination of the fine-structure constant based on quantized Hall resistance. 
Phys. Rev. Lett. 45, 494-497 (1980). 

2. Tsui, D. C., Stormer, H. L. & Gossard, A. C. Two-dimensional magnetotransport in 
the extreme quantum limit. Phys. Rev. Lett. 48, 1559-1562 (1982). 

3. Kane, C. L. & Mele, E. J. Quantum spin Hall effect in graphene. Phys. Rev. Lett. 95, 
226801 (2005). 

4. Bernevig, B. A. & Zhang, S.-C. Quantum spin Hall effect. Phys. Rev. Lett. 96, 
106802 (2006). 

5. Sheng, D.N., Weng, Z. Y., Sheng, L. & Haldane, F. D. M. Quantum spin-Hall effect 
and topological Chern numbers. Phys. Rev. Lett. 97, 036808 (2006). 

6. Haldane, F. D. M. Model for a quantum Hall effect without Landau levels: 
Condensed-matter realization of the ‘parity anomaly’. Phys. Rev. Lett. 61, 
2015-2018 (1988). 


974 


NATURE| Vol 452|24 April 2008 


7. Fu,L.& Kane, C. L. Topological insulators with inversion symmetry. Phys. Rev. B 76, 
045302 (2007). 

8. Murakami, S. Phase transition between the quantum spin Hall and insulator 
phases in 3D: emergence of a topological gapless phase. New J. Phys. 9, 356 
(2007). 

9. Fu,L., Kane, C. L. & Mele, E. J. Topological insulators in three dimensions. Phys. 

Rey. Lett. 98, 106803 (2007). 

0. Moore, J. E. & Balents, L. Topological invariants of time-reversal-invariant band 
structures. Phys. Rev. B 75, 121306(R) (2007). 

1. Roy, R. Three dimensional topological invariants for time reversal invariant 
Hamiltonians and the three dimensional quantum spin Hall effect. (http:// 
arXiv.org/abs/cond-mat/0607531v1) (2006). 

2. Bernevig, B. A., Hughes, T. L. & Zhang, S.-C. Quantum spin Hall effect and 
topological phase transition in HgTe quantum wells. Science 314, 1757-1761 
(2006). 

3. Konig, M. et al. Quantum spin Hall insulator state in HgTe quantum wells. Science 

318, 766-770 (2007). 

4. Wolff, P. A. Matrix elements and selection rules for the two-band model of 

bismuth. J. Phys. Chem. Solids 25, 1057-1068 (1964). 

5. Fukuyama, H. & Kubo, R. Interband effects of magnetic susceptibility. II. 

Diamagnetism of bismuth. J. Phys. Soc. Jpn 28, 570-581 (1970). 

6. Buot, F. A. Weyl transformation and the magnetic susceptibility of a relativistic 

Dirac electron gas. Phys. Rev. A 8, 1570-1581 (1973). 

7. Lenoir, B. et al. in Fifteenth International Conference on Thermoelectrics (Pasadena, 
California) 1-13 (IEEE, New York, 1996). 

8. Zhang, Y., Tan, Y.-W., Stormer, H. L. & Kim, P. Experimental observation of the 

quantum Hall effect and Berry's phase in graphene. Nature 438, 201-204 (2005). 

9. Novoselov, K. S. et al. Room-temperature quantum Hall effect in graphene. Science 

315, 1379 (2007). 

20. Zhou, S. Y. et al. Substrate-induced bandgap opening in epitaxial graphene. Nature 

Mater. 6, 770-775 (2007). 

21. Behnia, K., Balicas, L. & Kopelevich, Y. Signatures of electron fractionalization in 

ultraquantum bismuth. Science 317, 1729-1731 (2007). 

22. Liu, Y. & Allen, E. Electronic structure of semimetals Bi and Sb. Phys. Rev. B 52, 

566-1577 (1995). 

23. Hebel, L. C. & Smith, G. E. Interband transitions and band structure of a BiSb alloy. 

Phys. Lett. 10, 273-275 (1964). 

24. Kane, C. L. & Mele, E. J. Z2 topological order and the quantum spin Hall effect. 

Phys. Rev. Lett. 95, 146802 (2005). 

25. Ast, C. R. & Hochst, H. Fermi surface of Bi(111) measured by photoemission 

spectroscopy. Phys. Rev. Lett. 87, 177602 (2001). 

26. Hochst, H. & Gorovikov, S. Lack of electron-phonon coupling along two- 

dimensional bands in Bi, — ,Sb, single crystal alloys. J. Elect. Spectrosc. Relat. 

Phenom. 144, 351-355 (2005). 

27. Hofmann, P. The surfaces of bismuth: Structural and electronic properties. Prog. 

Surf. Sci. 81, 191-245 (2006). 

28. Hirahara, T. et al. Direct observation of spin splitting in bismuth surface states. 

Phys. Rev. B 76, 153305 (2007). 

29. Hengsberger, M. et al. Photoemission study of the carrier bands in Bi(111). Eur. 

Phys. J. B 17, 603-608 (2000). 

30. Ast, C. R. & Hochst, H. Electronic structure of a bismuth bilayer. Phys. Rev. B 67, 

13102 (2003). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We thank P. W. Anderson, B. A. Bernevig, L. Balents, 

E. Demler, A. Fedorov, F. D. M. Haldane, D. A. Huse, C. L. Kane, R. B. Laughlin, 

J. E. Moore, N. P. Ong, A. N. Pasupathy, D. C. Tsui and S.-C. Zhang for discussions. 
The synchrotron experiments are supported by the DOE-BES and materials 
synthesis is supported by the NSF-MRSEC at Princeton Center for Complex 
Materials. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. Correspondence and requests for materials should be 
addressed to M.Z.H. (mzhasan@princeton.edu). 


©2008 Nature Publishing Group 


doi:10.1038/nature06843 


METHODS 

Growth method for high-quality single crystals. The Bi, —,Sb, single-crystal 
samples (0 = x<0.17) used for ARPES experiments were each cleaved from a 
boule grown from a stoichiometric mixture of high-purity elements. The boule 
was cooled from 650 °C to 270°C over a period of five days and was annealed 
for seven days at 270 °C. The samples naturally cleaved along the (111) plane, 
which resulted in shiny flat silver surfaces. X-ray diffraction measurements 
were used to check that the samples were single phase, and confirmed that 
the Bip.9Sbo.; single crystals presented in this paper have a rhombohedral A7 
crystal structure (point group R3m), with room-temperature (300K) lattice 
parameters a= 4.51 Aand c= 11.78 A indexed using a rhombohedral unit cell. 
The X-ray diffraction patterns of the cleaved crystals exhibit only the (333), 
(666), and (999) peaks, showing that the cleaved surface is oriented along the 
trigonal (111) axis. Room-temperature data were recorded on a Bruker D8& 
diffractometer using Cu Ko radiation (A = 1.54 A) and a diffracted-beam 
monochromator. The in-plane crystal orientation was determined by Laue 
X-ray diffraction. During the angle-resolved photoemission spectroscopy 
(ARPES) measurements a fine alignment was achieved by carefully studying 
the band dispersions and Fermi surface symmetry as an internal check for 
crystal orientation. 

Transport measurements. Temperature-dependent resistivity measurements 
were carried out on single-crystal samples in a Quantum Design PPMS-9 
instrument, using a standard four-probe technique on approximately 
4X 1X 1-mm®, rectangular samples with the current in the basal plane, which 
was perpendicular to the trigonal axis. The four contacts were made by using 
room-temperature silver paste. The data for samples with concentrations ran- 
ging from x= 0 to x= 0.17 showed a systematic change from semimetallic to 
insulating-like behaviour with increasing x, in agreement with previously pub- 
lished works'*, which was used as a further check of the antimony concentra- 
tions. Conventional magnetic and transport measurements”'”*! such as these 
cannot separately measure the contributions of the surface and bulk states to 
the total signal. ARPES, on the other hand, is a momentum-selective tech- 
nique*’, which allows for a separation of 2D (surface) from 3D (bulk) disper- 
sive energy bands. This capability is especially important for Bi, ,Sb, because 
the Dirac point lies at a single point in the 3D Brillouin zone, unlike for 2D 
graphene, where the Dirac points can be studied at any arbitrary perpendicular 
momentum along a line****. 

Systematic methods for separating bulk from surface electronic states. ARPES 
is a photon-in, electron-out technique**. Photoelectrons ejected from a sample 
by a monochromatic beam of radiation are collected by a detector capable of 
measuring its kinetic energy Exin. By varying the detector angles, 0 and 9, relative 
to the sample surface normal, the momentum of the photoelectrons, K, can also 
be determined (as illustrated in Supplementary Fig. la). By employing the com- 
monly used free-electron final state approximation, we can fully convert from 
the measured kinetic energy and momentum values of the photoelectron to the 
binding energy, Es, and Bloch momentum values k of its initial state inside the 
crystal, via 


nature 
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ky=Ky= jV 2meBkin sin 0 
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1 
k= P V/ 2me( Ekin cos? 0 — Vo) 


where we have set y = 0, Wis the work function, m, is the electron mass and Vo is an 
experimentally determined parameter, which is approximately —10 eV for bis- 
muth**”®, Features in the ARPES spectra originating from bulk initial states (dis- 
persive along the k,-direction) were distinguished from those originating from 
surface initial states (non-dispersive along the k,-direction) by studying their 
dependence on incident photon energy, hv, and converting this to dependence 
on k, via the displayed equations. ARPES data were collected at beamlines 12.0.1 
and 10.0.1 of the Advanced Light Source at the Lawrence Berkeley National 
Laboratory, as well as at the PGM beamline of the Synchrotron Radiation Center 
in Wisconsin, with incident photon energies ranging from 17 eV to 55 eV, energy 
resolutions ranging from 9 meV to 40 meV and momentum resolution better than 
1.5% of the surface Brillouin zone, using Scienta electron analysers. The combina- 
tion of high spatial resolution and high crystalline quality enabled us to probe only 
the highly ordered and cleanest regions of our samples. Single-crystal Bi,—,Sb, 
samples were cleaved in situ at a temperature of 15 K and chamber pressures less 
than 8 X 10 |! torr, and high surface quality was checked throughout the measure- 
ment process by monitoring the EDC linewidths of the surface state. To measure the 
near-Ey dispersion of an electronic band along a direction normal to the sample 
surface, such as the direction from X (2n/\3a, 0, 81/c) to L (2n/ / 3a, 0, 11n/c) shown 
in Fig. 2a, EDCs were taken at several incident photon energies. The kinetic energy of 
the photoelectron at Ey is different for each value of hy, so the angle 0 was first 
adjusted and then held fixed for each hv so as to keep k,, constant at 2n//3a = 0.8 A! 
for electrons emitted near Ep. To ensure that the in-plane momentum remained 
constant at M (the LX line projects onto M) for each EDC, a complete near-E; 
intensity map was generated for each photon energy to precisely locate the M-point 
(see Supplementary Fig. 1d). We note that because the bulk crystal has only three- 
fold rotational symmetry about the k,-axis, the reciprocal lattice does not have 
mirror symmetry about the k, = 0 plane. Therefore, scans taken at +0 and —0 for 
the same photon energy probe different points in the bulk 3D Brillouin zone; this is 
responsible for the absence of the bulk A-shaped band in Fig. 3f. 
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Small phonon contribution to the photoemission kink 
in the copper oxide superconductors 


Feliciano Giustino’”, Marvin L. Cohen’” & Steven G. Louie’ 


Despite over two decades of intense research efforts, the origin of 
high-temperature superconductivity in the copper oxides remains 
elusive. Angle-resolved photoemission spectroscopy experi- 
ments’ have revealed a kink in the dispersion relations (energy 
versus wavevector) of electronic states in the copper oxides at 
binding energies of 50—80 meV, raising the hope that this anom- 
aly could be a key to understanding high-temperature supercon- 
ductivity. The kink is often interpreted in terms of interactions 
between the electrons and a bosonic field. Although there is no 
consensus on the nature of the bosons (or even whether a boson 
model is appropriate), both phonons’ and spin fluctuations” 
have been proposed as possible candidates. Here we report 
first-principles calculations of the role of phonons and the 
electron-phonon interaction in the photoemission spectra of 
La, — ,Sr,CuO,. Our calculations within the standard formalism 
demonstrate that the phonon-induced renormalization of the elec- 
tron energies and the Fermi velocity is almost one order of mag- 
nitude smaller than the effect observed in photoemission 
experiments. Therefore, our result rules out electron-phonon 
interaction in bulk La, _ ,Sr,CuO, as the exclusive origin of the 
measured kink. Our conclusions are consistent with those reached 
independently in a recent study’ of the related compound 
YBazCu3O7. 

Previous first-principles investigations have already suggested a 
rather weak electron-phonon coupling*’ in Cag.27Sro,63CuO2 and 
YBa,Cu;O7. However, because these studies did not directly address 
the signature of the electron-phonon interaction in the photoemis- 
sion spectra, their implications for the interpretation of the kink are 
unclear. In the present study we aim to perform a direct and quanti- 
tative comparison between the measured and the calculated photo- 
emission kinks in a prototypical copper oxide superconductor. We 
studied Laz — ,Sr,CuO,4 (LSCO) because it crystallizes in a simple 
lattice characterized by a single CuO, layer, and the hole concentra- 
tion can be experimentally controlled over the entire phase diagram 
by varying the strontium content®. We considered optimally doped 
(x=0.15) and heavily overdoped (x= 0.30) LSCO, for which a 
mean-field description of the electronic structure such as that given 
by density functional theory is appropriate to interpret measured 
Fermi surfaces’. We did not attempt to address the electron-phonon 
coupling in the underdoped compound because for that case the 
validity of the Fermi liquid approach has been questioned’. In order 
to calculate the electronic structure of LSCO we employed a genera- 
lized gradient approximation to density functional theory”’® (see 
Supplementary Information). The calculated Fermi surfaces of 
LSCO are generally in good agreement with angle-resolved photo- 
emission spectroscopy experiments’! (see Supplementary Fig. 1), 
and in particular they reproduce the evolution of the Fermi surface 
topology from a (0, 0)-centred electron pocket in overdoped LSCO 
to a (, m)-centred hole pocket in the underdoped compound” 


(wavevectors are given in units of 1/a, a= 3.774 being the lattice 
parameter). 

We studied the lattice dynamics of LSCO by calculating the pho- 
non eigenmodes and eigenenergies across the entire Brillouin zone 
using density functional perturbation theory’’. In Fig. 1 we compare 
the calculated phonon dispersions along the antinodal direction 
(0,0)—-(,0) and the total vibrational density of states to inelastic 
neutron scattering data’*’’. The dispersion of the in-plane 
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Figure 1| Phonons of La, _ ,Sr,CuO, at optimal doping (x = 0.15). 

a, Calculated dispersions along the antinodal direction (0, 0)—(27, 0) (solid 
lines), compared with the inelastic neutron scattering data of ref. 14 (circles: 
300 K data; disks: 10 K data). The labels indicate the phonon branches 
associated with the full-breathing (FB) in-plane Cu—O stretching mode at 
(n, 1), the half-breathing (HB) in-plane Cu—O stretching mode at (7, 0), the 
c axis apical (AP) oxygen stretching mode at (0,0), and the buckling/ 
stretching (BS) modes at (1, 0) and (7, 2). b, Calculated phonon density of 
states (solid line), compared with the inelastic neutron scattering data of ref. 
15 (disks). The stepped black lines indicate the correspondence between the 
measured and calculated peaks. The theoretical phonon energies agree with 
experiment to within 5 meV. The shaded regions correspond to the phonon 
branches highlighted in a. The calculated half-breathing (1, 0) and apical 
(0, 0) frequencies respectively correspond to energies of 68 meV and 53 meV, 
whereas the experimental values are 68-70 meV and 57 meV (ref. 14). To 
extend the present comparison to the undoped parent oxide, we also 
performed the calculations for the (x, 0) half-breathing mode in 
antiferromagnetic La,CuOy,. The theoretical energy of 76 meV agrees with 
neutron scattering measurements yielding 74 meV (ref. 16), and indicates 
that our theoretical framework correctly describes the softening of the half- 
breathing phonon mode through the insulator-to-metal transition’. 
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half-breathing Cu—O stretching mode is found to be in very good 
agreement with experiment. Inspection of the vibrational density of 
states shows that a one-to-one correspondence can be established 
between the calculated and measured peaks, indicating an overall 
agreement between theory and experiment’*"”. 

An electron travelling in a solid distorts the lattice because of the 
Coulomb interaction with the ions. The lattice distortion in turn has 
a feedback on the electron dynamics, resulting in an increase of the 
electron mass and a shortening of the electron lifetime in a particular 
quasi-particle state. Within quantum field theory, this effect is 
described in terms of a complex self-energy X that the electron 
acquires as a consequence of the electron—phonon interaction. The 
real part of the self-energy describes the change in the electron energy, 
and the imaginary part provides information on the electron lifetime 
t through t=//(2Im). The electron self-energy is calculated within 
the Migdal approximation to Feynman—Dyson perturbation the- 
ory'**°. Because of the d-wave symmetry of the pseudogap*' and 
the superconducting gap’’, this is a reasonable approximation for 
the (0, 0)—(z, =) nodal direction. By focusing on the nodal direction 
we also avoid the van Hove singularity at (7, 0), and the comparison 
with experiment can be performed with little ambiguity. The diag- 
onal part of the electron self-energy operator ¥ (r,r’,«) with respect 
to the unperturbed electronic states arising from the electron— 
phonon interaction is'**° 


Dk(@) = (nk | X(r,2',@) | nk) 


dk’ ae 
= ZA |smn,v k,k 
S| Fe lem) a 


my 


x 1 =f k’ on Ngy \ Sink! =F Ngy 
. T oc 
End —Ogy— 10 O— Eqs + gy — 10 


where r and r’ correspond to position vectors; w is the frequency 
of the quasi-particle excitation; |nk) denotes a Bloch eigenstate 
with wavevector k, band index n and energy é 3 and 
SnnvCk, k’) = (mk | Agy V(r) |nk) is the electron-phonon matrix 
element for the scattering |nk) >|mk’) through a phonon of wave- 
vector q=k’ —k, branch index v and energy wq,. The operator 
Aqy V(x) represents the variation of the self-consistent potential with 
respect to a collective lattice displacement associated with this pho- 
non, and fy;k’ and ngy, which are respectively the fermion and boson 
occupation numbers, account for the temperature dependence of the 
self-energy. 

The accurate evaluation of the electron self-energy requires the 
calculation of the electron-phonon matrix elements ginn,»(k,k’) on 
an extremely fine mesh consisting of more than 100,000 phonon 
wavevectors. This represents a formidable computational task and 
was previously beyond the reach of existing approaches. To over- 
come this difficulty we developed a new technique that is based on 
the Wannier representation” and exploits the short-range nature of 
the electron-phonon interaction in real space**”’. Figure 2 shows the 
electron self-energy calculated for optimally doped and overdoped 
LSCO at 20K. At optimal doping the real part of the self-energy 
displays a peak at a binding energy of 70 meV, a shoulder at a binding 
energy of 40 meV and a broad background extending up to much 
higher binding energies. A mode-resolved analysis shows that the 
main peak at 70 meV can be assigned to the Cu—O in-plane half- 
breathing motion around (n, 0) and full-breathing motion around 
(1, 1). The shoulder at 40 meV arises from phonons involving com- 
ponents of both out-of-plane buckling of the planar oxygen atoms 
and in-plane O-O stretching vibrations. The broad high-energy 
background can be shown to arise from a density-of-states effect 
(see Fig. 2 legend). As a consequence of the larger hole concentration, 
in the overdoped regime the high-energy background moves closer to 
the Fermi level, causing the main peak to appear slightly blue shifted 
and the shoulder to lose intensity. 
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To clarify the origin of the 70 meV peak and the 40 meV shoulder, 
in Fig. 3 we plot the squared electron-phonon matrix elements 
|e (k,k’) ie each of which represents the transition probability 
from a initial nodal state (corresponding to the combination of the 
Cu dz_, orbitals and the p, orbitals of the planar O atoms) with 
wavevector k on the Fermi surface to a final state with wavevector k’, 
associated with a given phonon with wavevector q=k’—k. Our 
calculations show that the matrix elements corresponding to the 
half-breathing and the full-breathing modes are by far the largest 
among all vibrational modes (with the exception of the apical oxygen 
stretching mode; see Fig. 3 legend), thereby explaining the main 
peak in the self-energy at 70 meV. On the other hand, the 40 meV 
buckling/stretching modes show only moderately large matrix 
elements in comparison with the 70 meV branches. However, these 
modes connect the nodal region to a large portion of the Fermi 
surface, and therefore their contribution to the self-energy is 
enhanced by a significant phase space effect. 
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Figure 2 | Calculated electron self-energy in LSCO due to the 
electron-phonon interaction. a, b, Real parts of the self-energy for optimally 
doped (x = 0.15) and heavily overdoped (x = 0.30) LSCO, respectively, at 
20 K. ¢, d, Imaginary parts of the self-energy for optimally and heavily 
overdoped LSCO, respectively. The inset axes in each panel are the same as 
those in the main plot. In a—d each inset shows the relevant (real or 
imaginary) part of the self-energy on an extended energy scale. We have 
computed the self-energy for electrons with parallel momentum k along the 
cut (0,0, k, )—(, 1, k,.) (see Supplementary Fig. 1b) for three different 
values (0, ma/c, and 2na/c) of the normal component k, . The variation of the 
self-energy along the c axis is negligible along the cuts studied; therefore, we 
only show the results for the cut through the zone centre. The black markers 
in a—d indicate the main peak at ~70 meV and the shoulder at ~40 meV. In 
a the black dashed lines represent the contributions to the self-energy from 
the half-breathing and full-breathing Cu—O stretching modes, and the 
dotted lines represent the contributions to the self-energy from the 
breathing/stretching modes around 40 meV. The sum of these two 
contributions (green area) accounts for more than 80% of the real part of the 
self-energy. The imaginary part of the self-energy in the insets of ¢ and 

d exhibits a step-like behaviour with a leading edge around 40-70 meV. This 
is a consequence of the Pauli exclusion principle: holes with energy below the 
threshold for phonon emission ~40—70 meV cannot make transitions to 
above the Fermi level by emitting a phonon, and thus exhibit longer 
lifetimes. Because the electronic density of states increases with increasing 
binding energy, a hole with large binding energy has an increased probability 
of decaying through phonon emission/absorption, owing to the larger 
number of final states available. The increased decay probability is reflected 
in the imaginary part of the self-energy and gives rise to the broad high- 
energy background in the real part through a Kramers—Kronig relation (see 
Supplementary Discussion and Supplementary Fig. 2). 
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Figure 3 | Electron-phonon matrix elements. Intensity plots of the squared 
electron-phonon matrix elements |g(k, k’)|*, each of which represents the 
transition probability for the scattering from a state with wavevector k on the 
Fermi surface (indicated by the white dots) to a final state with wavevector 
k’ on the Fermi surface, for optimally doped LSCO. a, The matrix elements 
associated with the half-breathing phonons connect adjacent Fermi surface 
arcs with a maximum strength of g* = 7.0 X 10° meV’. b, The full-breathing 
modes connect Fermi arcs on the opposite sides of the Fermi surface with a 
maximum strength of g” = 6.7 X 10° meV’. ¢, The apical oxygen stretching 
mode also shows large matrix elements, of value g* = 6.5 X 10° meV’. 
However, this phonon gives rise only to momentum-conserving transitions 
(that is, with k = k’); hence, the number of allowed final states is negligible 
and its coupling to nodal holes is frustrated. d, The squared matrix element 
for the mixed buckling/stretching modes at a binding energy of 40 meV. 
These modes involve significant momentum transfer along the c axis 

(k’, —k, = 2na/c) and connect the nodal region to a large portion of the 
Fermi surface. The largest matrix elements are found at the intersection of 
the Fermi surface with the (k,, k,, +2na/c) planes. The largest matrix 
element in this case is g” = 4.3 X 10° meV”. The squared matrix elements in 
heavily overdoped LSCO are similar to those obtained at optimal doping 
(within 20%). 
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Figure 4 | Comparison between theory and experiment. a, Comparison 
between the real part of the electron self-energy, as obtained by angle- 
resolved photoemission spectroscopy experiments’, and the first-principles 
theoretical self-energy arising from the electron-phonon interaction. From 
the raw data of Fig. 1a and d of ref. 1 we extracted the self-energy, assuming a 
linear dispersion and using a bare velocity of i|v| = 3.4 eV A obtained by a fit 
to the data using a Holstein electron—boson self-energy model (see 
Supplementary Discussion and Supplementary Fig. 3; the precise value of 
the bare velocity is not critical to our conclusions). We show the 
experimental data at 20 K for optimally doped (x = 0.15) and overdoped 
(x = 0.22) LSCO, as well as the calculated self-energy in the optimally doped 
(x = 0.15) and heavily overdoped (x = 0.30) cases at the same temperature. 
b, c, Comparison between the quasi particle dispersion relations obtained 
from the peaks of the photoemission spectra and our calculations, including 
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We now proceed to a quantitative comparison between our calcu- 
lations and the photoemission data of ref. 1. The measured peaks 
provide the quasi-particle energy E,,, (referred to the Fermi energy) 
for a given parallel momentum k. The real part of the self-energy 
can be extracted from Re (En) = Enk — nk» Where &,, denotes 
the non-interacting energy'®. Along the nodal direction, the non- 
interacting dispersions are linear and we can assume, as is commonly 
done, that é;; =/v-k, where the bare velocity v is to be determined. 
To extract the experimental self-energy we determined the bare 
velocity by fitting the raw data of figure 1a and d of ref. 1 to a simple 
Holstein self-energy model (see Supplementary Information). Using 
the experimentally determined v, we extracted the real part of the self- 
energy, which is shown in Fig. 4. A visual comparison between the 
theoretical self-energy and the experimental data indicates that the 
effect predicted by theory is considerably weaker. We note that this 
result does not depend on the choice of the bare band dispersions. 
When comparing the raw data of ref. 1 with our calculated electron 
dispersions fully renormalized by the electron-phonon interaction 
(Enk = En + Re Xia( En); see Fig. 4b, c), the discrepancy between 
experiment and theory is clearly visible. 

To be quantitative and unambiguous, we extracted the electron— 
phonon coupling strength 2 by measuring the gradients of both the 
theoretical and the experimental self-energy data sets at low energy 
(in the limit of small binding energy the self-energy arising from the 
electron—phonon interaction is Re Y,(E,4) ~ —AE ). We took the 
gradient at the Fermi level as an upper bound on J, and the gradient 
between the Fermi level and a binding energy of 50 meV as a lower 
bound on 4. This procedure yielded the experimentally determined 
average electron—phonon couplings A¢xp¢ = 1.00-1.32 for the optim- 
ally doped sample at 20 K and exp = 0.75-0.99 for the overdoped 
sample at 20 K, consistent with the estimates provided in ref. 1. From 
the calculated self-energy, however, our analysis yielded the theo- 
retical results 24, =0.14-0.22 at optimal doping and 2, = 0.14- 
0.20 in the overdoped regime. Hence, the calculated renormalization 
of the nodal Fermi velocity is Aexpt/Ath = 5-7 times smaller than the 
experimentally determined value. 

Although there are numerous effects beyond the density- 
functional/Migdal formalism that ought to be carefully considered, 
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the renormalization due to the electron—phonon interaction (optimally 
doped (b), overdoped (c)). Following ref. 1, we normalized the 
photoelectron wavevector to one at the binding energy of 170 meV. The 
magnitude of the calculated self-energy is so small that the fine structures 
corresponding to the 40 meV and 70 meV excitations shown in Fig. 2 can 
hardly be seen. The Migdal approximation to the Feynman—Dyson 
perturbation theory (equation (1)) using the density functional theory band 
structure is reasonable for states along the nodal direction in the normal, 
superconducting and pseudogap phases, because the phonons which are 
responsible for the 70 meV peak in Fig. 2a connect the nodal regions mainly 
among themselves (Fig. 3a, b); therefore, the important transitions 
contributing to the self-energy are those between the Fermi surface arcs, 
where the gap function is very small. 


977 


©2008 Nature Publishing Group 


LETTERS 


the large discrepancy between theory and experiment stands out as a 
qualitative effect. Therefore, it is unlikely that such a discrepancy will 
be resolved by removing some of the approximations we adopted in 
our calculations (see Supplementary Information). Importantly, this 
disagreement is also found in the overdoped regime, where the effects 
of electron correlations are reduced. These observations lead us to 
conclude that the electron—phonon interaction in bulk LSCO cannot 
account for the magnitude of the experimentally observed kink. 

Our results are consistent with previous first-principles calcula- 
tions** on the electron-phonon coupling in the copper oxides. 
Brillouin zone-averaged coupling strengths Aaye ~ 0.4 and Aaye = 0.27 
have been obtained using linear-response techniques for 
Cao,27Sr0,63CuO2 and YBayCu3Ov, respectively*’. When we evaluate 
the same parameter using the standard definition, which corresponds 
to the first reciprocal energy moment of the Eliashberg function” 
o°F(), we obtain Jaye = 0.4 for optimally doped LSCO, in agree- 
ment with those studies. In addition, our results are fully consistent 
with very recent theoretical work’ on the related compound 
YBa,Cu3O7, where the renormalization of the nodal Fermi velocity 
was calculated to be 3—5 times smaller than the experimentally deter- 
mined value. 

The analysis of alternative descriptions involving spin excita- 
tions”®, as well as other models in which electron-phonon coupling 
plays a minor role in angle-resolved photoemission spectroscopy”’, is 
beyond the scope of the present work. However, the contribution of 
non-phononic mechanisms to the kink remains an important open 
question. We stress that the present study does not rule out theories 
of high-temperature superconductivity based on novel phonon 
mechanisms and topological approaches, such as the dopant perco- 
lative model”*. 

Recently, a photoemission study of Bi,Sr.CuOg¢ suggested that the 
apparent magnitude of the kink is enhanced by the curvature of the 
bare electronic bands”, and that the electron—phonon interaction 
accounts for only a fraction of the kink. If we were to assume that 
effects not included in our theoretical treatment could moderately 
enhance our calculated self-energy, then the experimental and theo- 
retical results on the phonon contribution could become consistent, 
in line with the recent observation of a conventional isotope effect 
(Z. X. Shen, personal communication). In any event, our calculations 
indicate an electron—phonon coupling considerably weaker than was 
previously suggested'. 
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Eocene/ Oligocene ocean de-acidification linked to 
Antarctic glaciation by sea-level fall 


Agostino Merico’”, Toby Tyrrell’ & Paul A. Wilson! 


One of the most dramatic perturbations to the Earth system dur- 
ing the past 100 million years was the rapid onset of Antarctic 
glaciation near the Eocene/Oligocene epoch boundary’* (~34 
million years ago). This climate transition was accompanied’ by 
a deepening of the calcite compensation depth—the ocean depth 
at which the rate of calcium carbonate input from surface waters 
equals the rate of dissolution. Changes in the global carbon cycle’, 
rather than changes in continental configuration’, have recently 
been proposed as the most likely root cause of Antarctic glaciation, 
but the mechanism linking glaciation to the deepening of calcite 
compensation depth remains unclear. Here we use a global bio- 
geochemical box model to test competing hypotheses put forward 
to explain the Eocene/Oligocene transition. We find that, of the 
candidate hypotheses, only shelf to deep sea carbonate partition- 
ing is capable of explaining the observed changes in both carbon 
isotope composition and calcium carbonate accumulation at the 
sea floor. In our simulations, glacioeustatic sea-level fall associated 
with the growth of Antarctic ice sheets permanently reduces global 
calcium carbonate accumulation on the continental shelves, lead- 
ing to an increase in pelagic burial via permanent deepening of the 
calcite compensation depth. At the same time, fresh limestones are 
exposed to erosion, thus temporarily increasing global river inputs 
of dissolved carbonate and increasing seawater 5'°C. Our work 
sheds new light on the mechanisms linking glaciation and ocean 
acidity change across arguably the most important climate transi- 
tion of the Cenozoic era. 

Evidence for Eocene ice rafting is compelling® but, despite specu- 
lation to the contrary’, high-resolution oxygen isotope records** 
suggest that Cenozoic ice sheets approaching their modern size were 
not initiated on Antarctica until the Eocene/Oligocene boundary. 
Explanations for Antarctic glaciation at the Eocene/Oligocene 
boundary fall into two main categories: those invoking changes in 
ocean circulation through tectonic opening of Southern Ocean gate- 
ways, and those invoking changes in the global carbon cycle, with 
recent studies supporting the latter*’. Proxy records of atmospheric 
CO, over the past 50 million years (Myr)’° suggest an overall shift 
from high levels during the Eocene (~1,000 to ~2,000 parts per 
million by volume, p.p.m.v.) to lower levels thereafter. Support for 
a fall in CO levels at the Eocene/Oligocene boundary comes from 
recent Ocean Drilling Program records’. These data show that dee- 
pening of the calcite compensation depth (CCD) across the Eocene/ 
Oligocene boundary was rapid and took place in two jumps, in lock- 
step with the growth of Antarctic ice-sheets and carbon cycle per- 
turbation—as tracked by changes in benthic foraminiferal calcite 
5'80 and 81°C, respectively. Yet the mechanism teleconnecting the 
onset of major Antarctic glaciation and CCD deepening is a subject of 
debate. At least four hypotheses have been invoked to link 8'*O 
increase (glaciation), 5'°C increase and CCD deepening (carbon 


cycle perturbation) across this key interval. These are H1, an increase 
in organic carbon burial rates*''"’; H2, an increase in weathering of 
silicate rocks'*"°; H3, an increase in global siliceous (versus calcare- 
ous) plankton export production’; and H4, a shift of global CaCO; 
sedimentation from shelf to deep ocean basins*’®"”. 

To test the power of these hypotheses to explain the Eocene/ 
Oligocene transition, we have developed a biogeochemical box model 
of the global ocean (Fig. 1) with long-term control on the carbon cycle 
given by weathering processes'*. A full suite of time series are output, 
including nutrient and phytoplankton concentrations, alkalinity, dis- 
solved inorganic carbon (DIC), pH, [co; ~ | for sea water, CCD, 8PC 
of burial fluxes plus atmospheric CO). In Fig. 2 we present the para- 
meters best compared to the high-resolution data sets available: simu- 
lated CCD and 8'°C in benthic foraminiferal calcite. 

In some ways H1 represents the obverse scenario to present-day 
fossil fuel emissions—a period of rapid burial of organic carbon 
(carbon sequestration) leading to CO, extraction from the atmo- 
sphere—ocean system. Eocene/Oligocene boundary sections in the 
Southern Ocean’ show increased accumulation rates of opal and 
benthic foraminifera suggesting that elevated marine organic carbon 
burial rates contributed to ice-sheet growth via acceleration of the 
biological pump and CO, removal*”’. Higher pH associated with 
CO, drawdown would have increased [Coz] and deepened the 
CCD. 

At least three different processes have been proposed to elevate 
organic carbon burial at the Eocene/Oligocene boundary. Arguably 
the simplest (Hla) is increased nutrient supply via more vigorous 
stirring in a cooler climate with a steeper meridional temperature 
gradient and a large Antarctic ice sheet®’**°. We test H1a by a per- 
manent 3.5-fold increase in vertical mixing rates in the model (mix- 
ing between surface and middle boxes, and between middle and deep 
boxes, KSM and KMD, respectively, in Fig. 1). The simulation fails 
(Fig. 2a and b, red lines) to reproduce the proxy records by predicting 
a permanent change in benthic 8'°C and only a temporary change in 
the CCD (opposite to the observations). The latter reflects rapid 
depletion of the finite ocean nutrient inventory. The increase in 
ventilation also produces a pronounced temporary increase (by 
about 400 p.p.m.v.) in atmospheric peo, (Supplementary Fig. 1)— 
at odds with the onset of Antarctic glaciation. 

A second process (H1b) is increased efficiency of organic carbon 
burial at the sea floor’, perhaps attributable to slower bacterial acti- 
vity in a cooler climate’*. We test H1b by employing a permanent 
50% increase in the fraction of surface-produced organic carbon 
(and phosphorus) that survives to be buried. The simulation fails 
(Fig. 2a and b, green lines) by predicting a permanent change in 81°C, 
whereas observations show a recovery. Using a temporary version of 
the same perturbation fails by predicting a temporary change in both 
8'°C and CCD. 


'National Oceanography Centre, Southampton, European Way, Southampton SO14 3ZH, UK. 7GKSS-Forschungszentrum, Institute for Coastal Research, Max Planck Strafe 1, 21502, 


Geesthacht, Germany. 
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We test a third process (H1c) for increased organic carbon burial 
(increased nutrient delivery to the oceans) by supplying an additional 
20% of both phosphorus and silicate down rivers. In these runs we 
decouple the effect of this perturbation from the weathering feedback 
by delivering the extra silicate through some (unspecified) process 
that does not simultaneously deliver alkalinity to the ocean. These 
simulations produce a modest increase in 8'°C followed by a persis- 
tent decline, and a CCD change that is of the wrong sign (Fig. 2a and 
b, black lines). CCD shoaling in these runs is traceable to calcifiers 
that are also being stimulated by the influx of extra nutrients, hence 
leading to a mismatch between carbonate burial and delivery down 
rivers. 

The second hypothesis (H2) tested is that early Oligocene 
glaciation cycles enhanced silicate weathering’**, reducing atmo- 
spheric CO,. Weathering of silicate rocks is a sink for atmospheric 
CO, and a source of alkalinity to the oceans and so, potentially, might 
explain both climate cooling and CCD deepening. A natural addi- 
tional consequence of increased silicate weathering on the continents 
is elevated riverine delivery of silicate leading to increased oceanic 
diatom production (see H3). However, to isolate the direct impact of 
silicate weathering on the carbon cycle, under H2, we allow silicate 
weathering to affect ocean alkalinity but not silicate levels. Under this 
hypothesis, a strong decrease in carbonate weathering, driven by 
climate cooling, causes [CO | to fall sharply, with a consequent 
shallowing of the CCD (Fig. 2d). Sensitivity analyses using different 
injections of alkalinity, and different dependences of the two weath- 
ering processes on poo, (Supplementary Fig. 6) produced the same 
conclusion (rejection of H2). 

The increase in opal accumulation rates across the Eocene/ 
Oligocene transition in the Southern Ocean'"”” has also led to a third 
hypothesis (H3) in which the global balance shifted away from calci- 
fiers towards silicifiers’. A plausible consequence of increased diatom 
dominance of phytoplankton assemblages is decreased abundance of 
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Figure 1| Our global biogeochemical box model. The model comprises a 
three-box ocean module with an implicit sediment layer and an atmospheric 
box. The biology is modelled explicitly and includes diatoms and others 
(some of which represents calcifying phytoplankton), phosphorus and 
silicate. The distribution of DIC within the ocean is governed by physical, 
chemical and biological processes: the exchange of CO, between the 
atmosphere and the ocean, vertical advection of water, the organic carbon 
and calcium carbonate pumps and chemical speciation. Two of the processes 
reducing alkalinity are deposition of atmospheric nitrogen, and input of 
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other phytoplankton, including coccolithophores, because total pro- 
duction is limited by phosphate*'. Assuming constant phosphorus 
and carbon river inputs, decreased surface production of CaCO will 
eventually lead to an increase in [COs] and to a deepening of the 
CCD. In contrast to H2, we test this hypothesis by applying a per- 
manent increase in the silicate riverine input to favour silicifiers 
against calcifiers. To decouple this perturbation from the weathering 
feedback, the extra silicate is delivered to the ocean through some 
(unspecified) process that does not simultaneously deliver alkalinity. 
The model captures some basic structure seen in the proxy records 
(for example, a permanent deepening of the CCD), but 8'°C does not 
recover to pre-excursion levels (Fig. 2e and f, red lines) when the 
perturbation is applied permanently. Applying the mechanism as a 
temporary perturbation results in a different mismatch with the data 
(simulated CCD shows only a temporary shift). An additional simu- 
lation in which increased silicate weathering is allowed to affect both 
alkalinity and silicate simultaneously (H2 in conjunction with H3) 
also produces unrealistic results with the effects of H2 outweighing 
those of H3. 

A further hypothesis (H4) is a shift in the balance of global CaCO; 
burial from shelf to deep sea*'*!”. This hypothesis has two compo- 
nents*: glacioeustatic sea level fall’* (by ~70 m) both reduces global 
shelf area and hence shelf burial of CaCO3, and also exposes pre- 
viously submarine shelf CaCO; to weathering. Modelling of the first 
component consists of a permanent decrease in shallow CaCO; 
burial. The second component, which has a temporary effect, consists 
of eroding and discharging to the ocean the newly exposed relatively 
fast-weathering” limestones. This extra discharge (~0.04-0.25 X 
10°Gt CaCO3-C in total; peak annual flux increased by ~50% 
to 300% over baseline, depending on the 5'°C of this carbonate) 
compares to an estimated**** ~20% to 50% increase in global car- 
bonate weathering during the Last Glacial Maximum (LGM) over 
today. Intense CaCO erosion at the Eocene/Oligocene boundary is 
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riverine nitrogen. In the biogeochemistry scheme, the dashed black arrows 
represent export out from the surface ocean. The silicate and carbonate 
feedbacks are indicated with dashed red arrows. PIC, particulate inorganic 
carbon; POC, particulate organic carbon; W-,;, carbonate weathering; W.i1, 
silicate weathering; KSM and KMD, mixing between surface and middle 
boxes and between middle and deep boxes, respectively. The arrows in the 
oceanic boxes and sediments represent the various remineralization and 
sedimentation fluxes. A full list of variables and parameter values is given in 
Supplementary Tables 1 and 2. 
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based on a threefold greater Eocene inventory of shelf carbonate than 
today’’, and its likely much greater average solubility—having not 
undergone mineralogical stabilization during repeated precursor 
large-amplitude glacial lowstands. We model the Eocene/Oligocene 
input as a two-step function (Supplementary Fig. 3), in line with 
interpretations** of how the Antarctic ice sheet grew, but our find- 
ings are not sensitive to this implementation. In our ‘best run’, the 
CCD shows an overall permanent deepening with a maximum initial 
response of about 2km; and 5'°C shows a temporary increase in 
agreement with the data (Fig. 2g and h, red lines). 

We ascribe more importance to the similarity in form of the 
observed and modelled records than to absolute values for two rea- 
sons. First, the model captures the global signal, whereas the records 
come from a single site (albeit from the largest ocean). Second, the 
model is conservative in that Eocene global shelf-to-basin CaCO, 
partitioning is set to 45:55, close to the estimated present-day ratio”, 
whereas the early Cenozoic shelf CaCO; sink was proportionately 
larger’’. Accordingly, we conclude that H4 is compatible with the 
observations. A permanent shift in the CCD, towards deeper values, 


313C (%o VPDB) 


313C (%o VPDB) 


2 
g (H4) 
i 1.5 
a F 
e 

=] ei 
x 
S oss: a 
2 : 

0 

32 33 34 35 

Age (Myr ago) 


Figure 2 | Model results simulating competing hypotheses and comparison 
to available data. Carbon isotope ratios in benthic foraminifera (a, c, e and 
g) and CCD evolution (b, d, f and h) for families of runs simulating the four 
different hypotheses H1, H2, H3 and H4 compared to the data’. a, b, H1 
shows increased organic carbon burial via: a more vigorous stirring of the 
ocean (red lines), increased efficiency of organic carbon burial at the sea 
floor (green lines), and increased nutrient delivery to the ocean (black lines). 
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is induced by the requirement of the ocean to maintain the input/ 
output balance of carbonate by a shelf-to-deep-basin switch in 
deposition. A change in 81°C of the right sign and approximate 
magnitude is brought about by the weathering of extra neritic 
CaCO3;, having*® a higher isotopic composition than the baseline 
CaCO; weathering and burial fluxes, both of which have substantial 
pelagic components (Supplementary Fig. 3). 

It has been suggested’’ that increased weathering to the ocean of 
calcium ions is the critical process, via its effect on saturation state and 
the CCD. We test this alternative by re-running H4, but with [Ca**] asa 
dynamic variable rather than a constant. In this experiment all CaCO; 
fluxes affect [Ca”*] as well as DIC and alkalinity. Given the large differ- 
ence in residence times (calcium ~1 million years; DIC ~ 100,000 years) 
and concentrations (calcium ~ 10.6 mmol; DIC ~2.3 mmol), the addi- 
tion of calcium produces a negligible difference to the model results (the 
green lines in Fig. 2g and h almost directly overlie their red line counter- 
parts). We therefore discount this alternative. 

The 5'°C and CaCO; MAR records from ODP Site 1218 provide 
strong constraints on the biogeochemistry of Eocene/Oligocene 
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c, d, H2 shows enhanced weathering of silicate rocks (red lines). e, f, H3 
shows silicifiers outcompeting calcifiers (red lines). g, h, H4 shows changes 
in shelf to deep basin CaCO; partitioning and erosion of newly exposed 
CaCO; as a result of sea-level fall, affecting the seawater saturation state via 
dumping of additional carbon to the ocean (red lines; we consider the thick 
red line to be our ‘best run’), and via dumping of both carbon and calcium in 
a 1:1 ratio (green line). Extra calcium was added only to the ‘best run’. 
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boundary events: a temporary increase in 8'°C followed by a full 
recovery, and a large deepening in the CCD, followed by only partial 
recovery such that a permanent deepening in the CCD of order 1 km 
is sustained. Any hypothesis must simultaneously account for both 
the permanent and the temporary changes. Only H4 is found to do 
this, when applied to our model. 

Glacioeustatic sea-level fall associated with the growth of the 
Antarctic ice sheet exposes widespread limestones to erosion; this 
leads to a one-off ‘dump’ of dissolved inorganic carbon and alkalinity 
into the ocean, increasing ico. 1, deepening the CCD and increas- 
ing seawater 8'°C. The reduction in shelf habitat for neritic benthic 
calcifiers leads to a large reduction in CaCO; burial state in shallow 
seas, with superimposed shorter term fluctuations as the East 
Antarctic ice sheet varies in size in response to orbital forcing***. 
Because river inputs of dissolved CaCO; must match global CaCO; 
burial’, the decline in shallow-sea burial forces a compensating 
increase in deep-sea burial through a shift in state to a deeper CCD. 

In nature, we might expect shelf to basin CaCO; fractionation to 
act in concert with other processes but additional runs to test H4 in 
combination with H1, H2 and H3 each resulted in poorer simula- 
tions of the high-resolution data sets available (Supplementary Fig. 
2). Improved pco, records are needed. Our simulations show that, 
acting alone, H4 brings about a large but transient pco, decline 
(Supplementary Fig. 1). If CCD deepening at the Eocene/Oligocene 
boundary marks a permanent shift to a lower poo, regime then addi- 
tional processes are indicated. Either way, our findings in support ofa 
sea-level-led hypothesis (H4) lend weight to the view® that rapid 
carbon-cycle perturbation associated with CCD deepening at the 
Eocene/Oligocene boundary did not trigger Antarctic glaciation. 
Instead, rapid glaciation, itself driven by slow long-term CO, draw- 
down via silicate weathering* and orbital forcing’ (inhibiting warm 
Antarctic summers), is capable of triggering a chain of carbon cycle 
responses, including CCD deepening, 5'°C perturbation and assoc- 
iated further (ice-sheet-stabilizing) rapid poo, drawdown, cooling 
and aridification””. 


METHODS SUMMARY 


Our biogeochemical box model of the global ocean includes two phytoplankton 
groups: diatoms and others (the latter including calcifiers) and the cycling of 
silicate, phosphorus, DIC and alkalinity (Fig. 1). The model simulates dynamic 
changes in the CCD and ocean carbon chemistry is linked, through air-sea 
gas exchange, with atmospheric CO}. Both '*C and '°C are simulated in each 
reservoir and flux. Long-term control on the carbon cycle is given by weathering 
processes'*. A full list of variables and parameter values is given in Supple- 
mentary Tables 1 and 2. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Description of the model. The ocean component, developed from previous 
work'**', includes three vertically stacked boxes: the surface (0-100 m), which 
represents the euphotic zone, a middle box (100-500 m), which represents the 
mixed surface layer above the annual thermocline, and a deep box (500- 
3,730 m), which represents the deep layer below the annual thermocline. The 
model represents an average water column down to the seabed, has a spatially 
and temporally averaged input of nutrients, DIC and alkalinity, and does not 
take into account any latitudinal or horizontal variations. The distribution of 
DIC within the ocean is governed by physical, chemical and biological processes: 
the exchange of CO, between the atmosphere and the ocean, riverine input of 
DIC, biological uptake of CO, into biomass, remineralization and burial of 
biomass, precipitation and dissolution of CaCO; and mixing processes between 
the three layers. The processes governing the distribution of alkalinity in the 
ocean are: riverine input of bicarbonate and nitrate, precipitation of CaCO, by 
organisms and the biological uptake of nitrate, the dissolution of CaCO; deeper 
in the ocean and the remineralization of nitrate, burial of CaCO; and mixing 
processes between the three layers. The production of CaCO; in the surface 
ocean is linked to the production of organic matter through the ‘rain ratio’, 
which is the molar ratio of particulate inorganic carbon (PIC) export from the 
surface layer to particulate organic carbon (POC) export. 

The silicate and carbonate weathering processes are modelled (according to 
ref. 18) using: Wear = fear X (Poo, / Poo) ANA Wein = fil X (Poo, / PCO)?» Wear 
and W,, are normalized to the initial peo, value: PCO. (ni): INput terms to the 
atmosphere also include a weathering process, the oxidation of fossil POC from 
rocks (kerogenic input), as well as CO, from volcanic outgassing (volcanic 
input). 

The model includes the calculation of the dynamic CCD and aragonite com- 
pensation depth (ACD). Ocean hypsometry was obtained from the ETOPO5 
data set at 5 X 5 min resolution across the global ocean. The relative fractions of 
ocean area at different depths were calculated by summing over the data set. To 
prevent unrepresentative biasing towards polar latitudes, each grid cell was 
weighted by the cosine of its latitude. The critical carbonate ion concentration 
([CO}~].,;,) below which sea water is undersaturated is a function of depth 
and is dependent on the crystal structure, specifically, calcite (mainly foramini- 
fera and coccolithophores) or aragonite (mainly neritic calcifiers). The model 
calculates deep carbonate ion concentration, and the following equations” (for 
calcite and aragonite respectively) [CO3~ ]_,;,(z) = 88.7exp[0.189(z—3.82)] and 
[CO3~ ],i¢(Z) = 117-5exp[0.176(z —3.06)] are used to calculate the depth at 
which [CO3~].,;,=[CO3~]. The ocean hypsometry is then used to calculate 
the fraction of ocean area above or below the CCD and the ACD, and hence 
the proportions of the sinking CaCO; flux that are buried or dissolved. 

The rate of CaCO; dissolution in the model is calculated solely as the product 
of the CaCO; sinking flux and the fraction of sea floor receptive to CaCO; 
burial. Previous carbon cycle studies**°** have included chemical erosion 
and/or respiration-driven dissolution. However, although chemical erosion of 
previously deposited CaCO; in the bioturbated zone must be considered for an 
ocean becoming increasingly acidic, it is not crucial for this study, in which the 
CCD is deepening. Respiratory dissolution does not generally decouple the 
lysocline from the saturation horizon™ and so was not included. Following 
ref. 29, dissolution of CaCO; is modelled without explicitly modelling sediment 
chemistry. 
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Stable isotope composition of the shell calcite of most species of 
foraminifera reveals offsets from calcite precipitated in equilibrium with 
the ambient bottom water”. These offsets have been attributed to so-called 
vital effects. Evidence from culture experiments*® suggests a significant effect 
of seawater carbonate ion concentration on the stable isotope composition 
of planktic foraminifera. This aspect was also taken into account in our 
model by using®*: 5! Cyne =5' Cpic — 0.008( [coz] —300). We also include 
the CO, effect on the 8°C of POC by using the following”: 
08 Cong = 5 Cyc — 9866 / Tx + 24.12 — 17 log jo ([COo(aq)]) +3-4- 

We note that whereas the model is based on a rather simple physical scheme of 

the global ocean (three vertical boxes), a previous study* (using a complex 
general circulation model with coupled components for atmosphere, ocean, 
ice sheet and sediment) has shown that physical processes (driven for instance 
by changes in continental configuration) had only a secondary role in the 
Eocene/Oligocene climate transition, compared to the processes controlling 
CO, concentration. A sensitivity analysis investigation suggests that the results 
obtained here are insensitive to model simplifications and assumptions 
(Supplementary Information). 
Steady-state and implementation of perturbations. The initial time of the 
simulations is the late Eocene (35 Myr ago). The model is run for 3 Myr across 
the Eocene/Oligocene boundary up to the early Oligocene (32 Myr ago). We start 
the model in the steady state, then apply a perturbation in accordance with each 
hypothesis at the Eocene/Oligocene boundary. As part of the sensitivity analysis, 
we generated a family of runs for each hypothesis by changing parameter values. 
We choose to establish the initial steady state at an atmospheric pco, of 
1,000 p.p.m.v. (Peo, (ini))> in keeping with proxy records'®. In accordance with 
the data available’ from the Pacific (‘global’) ocean, this steady-state is charac- 
terized by 5'°C in benthic foraminifera of +0.7%o VPDB (Vienna Pee-Dee 
Belemnite standard) and depths for the CCD and ACD of about 3km and 
0.7 km, respectively. Major fluxes resulting from this baseline steady state are 
reported in Supplementary Table 3. 
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Hydrous silicate melt at high pressure 


Mainak Mookherjee’, Lars Stixrude* & Bijaya Karki? 


The structure and physical properties of hydrous silicate melts and 
the solubility of water in melts over most of the pressure regime of 
Earth’s mantle (up to 136 GPa) remain unknown. At low pressure 
(up to a few gigapascals) the solubility of water increases rapidly 
with increasing pressure’, and water has a large influence on the 
solidus temperature, density’, viscosity’ and electrical conduc- 
tivity. Here we report the results of first-principles molecular 
dynamics simulations of hydrous MgSiO; melt. These show that 
pressure has a profound influence on speciation of the water com- 
ponent, which changes from being dominated by hydroxyls and 
water molecules at low pressure* to extended structures at high 
pressure. We link this change in structure to our finding that the 
water-silicate system becomes increasingly ideal at high pressure: 
we find complete miscibility of water and silicate melt throughout 
almost the entire mantle pressure regime. On the basis of our 
results, we argue that a buoyantly stable melt at the base of the 
upper mantle would contain approximately 3 wt% water and have 
an electrical conductivity of 18$m', and should therefore be 
detectable by means of electromagnetic sounding. 

The hydrogen bond is not well described by the dominantly ionic, 
atomistic models that have most often been applied to the study of 
silicate melts. First-principles molecular dynamics simulations are 
more costly in terms of computer time but have the important 
advantage of making no a priori assumptions regarding the nature 
of the bonding or the shape of the charge density. Density functional 
theory, on which our simulations are based, has been successfully 
applied to the study of a number of hydrous crystalline silicates and 
oxides’ and to the liquid SiO,-H,O system at ambient pressure’®. 

In our first-principles molecular dynamics simulations of a hyd- 
rous MgSiO3-H,0 liquid with 10 wt% water, the electronic structure, 
the Hellman—Feynman forces acting on the nuclei, and thermodyn- 
amic properties are computed at each time step in the local density 
and pseudopotential approximations. Atomic trajectories extracted 
from the simulations reveal the structure, speciation and dynamics of 
the melt. To quantify the influence of the water component on 
physical properties, we compare our results with a previous first- 
principles molecular dynamics study of anhydrous MgSiO; liquid’. 

Inspection of the equilibrated liquid structure shows that pressure 
has a large influence on the speciation of the water component 
(Fig. 1). Whereas at low pressure we find mostly hydroxyls and 
water molecules, at high pressure we see a much greater variety of 
species. These include Si-O—H-O-Si polyhedral linkages, -O—H- 
O-H- chains and O-H-O edge decoration of SiO, octahedra, the 
last of which was suggested in a previous experimental study of 
glasses*. We note another feature at low pressure that had not been 
anticipated experimentally: a strong preference for water molecules 
to bond to Mg”* cations. This chemical association, together with the 
decrease in the proportion of hydroxyls with increasing pressure, 
may explain the increase in the MgO/SiO) ratio of eutectic melts in 
the MgO-SiO,-H,O system with increasing pressure up to 15 GPa 
(ref. 9). 


The water component becomes more interconnected with increas- 
ing pressure, transforming from isolated molecular species at low 
pressure to a structure more reminiscent of bulk phases of water at 
higher pressure (Fig. 2a). The H—O coordination number varies in 
value from almost one at low pressure to two at the highest pressures 
investigated. This coordination number is similar to that in the high- 
pressure form of water’? and in ice in which the structure is a com- 
plete three-dimensional network of nearly symmetric hydrogen 
bonds (ice X). Simultaneously, as compression increases, the very 
open silica framework gives way to a much more densely packed 
structure in which little free volume remains to accommodate highly 
polar molecular species: molecules are squeezed out as pressure 
increases, and are replaced by extended structures. 

The addition of water disrupts the intermediate-range silicate 
structure: the O-Si coordination number of the hydrous melt is 
substantially less than that of the anhydrous melt, demonstrating 
that added water disrupts interpolyhedral linkages (Fig. 2b). Such 
disruption is consistent with experimental evidence at low pressure 
for the decrease in viscosity with increasing water content in silicate 
melts. Because we find water-induced disruption of the silicate net- 
work at all pressures, we anticipate that addition of water will 
decrease the viscosity of silicate melts at high pressure as well. In 
contrast, the Si-O coordination number in hydrous and anhydrous 
melts is identical at all pressures. Mg—O coordination numbers are 
similarly unaffected by the addition of water at pressures greater than 
20 GPa; at lower pressures the addition of water decreases the Mg—O 
coordination number through the interaction of water molecules 
with Mg cations. 

We find that protons are highly mobile, presenting a possible 
means of geophysical detection of deep melt. The diffusivity of 
hydrogen is an order of magnitude greater than that of the other 
ions. From our simulations, D, the self-diffusion coefficient of hydro- 
gen, is adequately described by the Arrhenius relation (Fig. 3) 


E* + PV* 
—aes 1 
) (1) 


where E* denotes the activation energy, V* denotes the activation 
volume, T denotes the temperature, P denotes the pressure, Do 
denotes the value of D in the limit of infinite temperature and k 
denotes the Boltzmann constant. The electrical conductivity ¢ due 
to charged mobile species is given by the Nernst—Einstein relation 


Dxq? 

° ETH 2) 
where xis the number of carriers per unit volume, the Haven ratio Hz 
approaches a value of one for small values of x (ref. 11) and q is the 
electrical charge of the carrier. 

Assuming that protons are the primary charge carriers and that 
Hg = 1, from equations (1) and (2) we estimate a conductivity of 
59c/10Sm_' under conditions thought to represent the mantle at a 
depth of 410 km (that is, a pressure of 14 GPa and a temperature of 


D=D exp(— 


'Department of Geology and Geophysics, Yale University, New Haven, Connecticut 06511, USA. “Department of Earth Sciences, University College London, Gower Street, London 
WCIE 6BT, UK. ?Department of Computer Science and Department of Geology and Geophysics, Louisiana State University, Baton Rouge, Louisiana 70803, USA. 
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1,800 K), where cis the percentage mass fraction of water in the melt. 
This corresponds to approximately 18S m7! for a neutrally buoyant 
melt, which at this depth has c= 3 wt% (see below). The electrical 


Figure 1| Structure and speciation of hydrous silicate melt. a, 3,000 K, 
V/V, = 1; b, 3,000 K, V/V, = 0.5. Red, yellow, green and cyan polyhedra 
represent four-, five-, six- and sevenfold Si-O coordination environments, 
respectively. Yellow spheres represent Mg atoms. Other spheres represent H 
atoms (small spheres) and those O atoms (large spheres) not solely bonded 
to Si; they are coloured pink if they are bonded to at least two other H atoms 
or O atoms, and are coloured blue if they are bonded to only one other H or 
O atom. Red ellipses highlight species discussed in the text, including 
hydroxyls (O-H in a), water molecules bound to Mg ions (Mg—O-H) in 
a), interpolyhedral bridges (Si-O—H-—O-Si in b), polyhedral edge decoration 
(O-H-O in b) and a Sic-O-H—O-H-O-Si interpolyhedral bridge (in b). The 
apparent equilibrium constant of the reaction* H,O + O= 20H at 3,000K 
and near-ambient pressure computed from our simulations is 3.1, which is 
in close agreement with the value (2.3) extrapolated from experimental 
measurements on rhyolite compositions” at much lower temperatures and 
water contents, and is consistent with the experimentally observed tendency 
of water molecules increasingly to dissociate in the melt as temperature 
increases. The molecular dynamics snapshots were rendered using the 
visualization system of Bhattarai et al.”*. In each panel, the black outline 
represents the primary simulation cell. 
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conductivity of hydrous silicate melt has not been measured at pres- 
sures greater than 1 GPa. Because the expected water concentration 
in the solid mantle on either side of a hydrous melt layer is much less 
than 1%, the melt layer would represent a large conductance anomaly 
(of 18,000 S, assuming that o = 18S m | and there is a 20-km-thick 
partial melt layer with 5% neutrally buoyant hydrous melt'’) that 
should be detectable by means of electromagnetic sounding. Indeed, 
recent inversions of electromagnetic data, although non-unique, fit a 
value of total conductance similar to the values we find’. 

The equation of state of the hydrous liquid, like that of the anhyd- 
rous liquid, differs from that of most crystalline mantle phases in an 
important way: the thermal pressure increases markedly on compres- 
sion (Fig. 4). The hydrous liquid equation of state can be described by 
the Mie—Griineisen form 


PV,T)=PAV.T)+ 2 C(T—T) 


where P. denotes the reference isotherm, taken to lie at T, = 3,000 K, 
y denotes the Griineisen parameter and C, denotes the isochoric heat 
capacity. These quantities are obtained from the simulation: as the 
internal energy Eand pressure P vary linearly with temperature along 
isochores (to within our uncertainty), the values are calculated at 
each volume from y= V(@P/0E),, and C, = (@E/0T),. The isocho- 
ric heat capacity decreases by about 8% upon twofold compression, 
from 3.63 + 0.21 Nk to 3.34 + 0.11 Nk, where N is the number of 
atoms. The value of y increases by a factor of three over the same 
compression range. This behaviour is opposite to that of mantle 
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Figure 2 | H-O and O-Si coordination numbers. a, Fraction of protons 
bonded to one (open circles), two (squares) and three (triangles) O atoms, 
and the mean H—-O coordination number Z(H—O) (solid line without 
symbols) at 4,000 K. The simulated pressure at 4,000 K is indicated along the 
top axis. b, Evolution of O-Si coordination number upon compression, for 
anhydrous’ (filled symbols) and hydrous (open symbols) melts at 3,000 K 
(blue), 4,000 K (green) and 6,000 K (red). Black lines represent temperature- 
averaged values. 
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crystalline phases and similar to that of anhydrous MgSiO; liquid’: 
within our uncertainty, the values of y for hydrous and anhydrous 
melts are identical over the entire volume range explored. 

The density of the hydrous melt is less than that of the anhydrous 
melt under all conditions explored here (Fig. 4a), but the density 
contrast is small enough that hydrous melts may be neutrally buoyant 
in the mantle, providing a possible explanation of a low-velocity layer 
found on top of the 410-km discontinuity'’*. The density contrast is 
remarkably constant across the pressure-temperature range of the 
mantle, varying from 0.3 to 0.4gcm ° between 1 and 100 GPa and 
3,000 and 6,000 K. We estimate the amount of water that would 
produce a neutrally buoyant melt at a depth of 410km as 
c/10 = (Paro — Panhydrous)/APH,0> where P410 = 3.54 gem ~ is the 
density at the base of Earth’s upper mantle", Panhydrous = 3-66 & cm? 
is the density of a partial melt of anhydrous peridotite’’ at 13.4 GPa 
and 1,873 K, and Apy,o = —0.35 g cm? is the density contrast that 
we find for 10 wt% water. We find from this analysis that c= 3 wt%, 
consistent with recent estimates’®*'”. 

The partial molar volume of water in the silicate melt, Vi,o, 
asymptotically approaches that of pure water, Vy,0, as pressure 
increases (Fig. 4b). At the highest pressure that we studied, Vi,0 
and Vy, are identical to within our uncertainty. This means that 
the volume of solution AV, which is large and negative at low pres- 
sure, approaches the ideal limit (AV— 0) as pressure increases. The 
increasing similarity of the volumes of water in the melt and in pure 
form is consistent with our finding that the water component 
becomes more structurally interconnected and more like bulk water 
with increasing pressure. An important consequence of the increas- 
ing ideality of the water-silicate solution with increasing pressure is 
the almost constant density of hydration that we find (Fig. 4a): if 
AV= 0 at all pressures, the density contrast between hydrous and 
anhydrous melts varies rapidly with pressure at low pressure. 


10-6 
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Figure 3 | Self-diffusion coefficient of hydrogen. Results at 3,000 K (blue), 
4,000 K (green) and 6,000 K (red) from our simulations (symbols) and an 
Arrhenius fit to the simulation results (lines) (equation (1) with 

Dy = 9.2 X 10-7 m’s_!, E* = 86kJ mol _! and V* = 0.11 cm? mol _!). The 
value of the activation energy is similar to that found in hydrous basaltic 
melts at low pressure”? (126 + 32 kJ mol '), and the value of the diffusivity 
agrees well with that extrapolated to 3,000 K from lower temperature, low- 
pressure experiments” (2.4 X 10 *m’s '), indicating that proton diffusion 
does not depend strongly on melt composition. Error bars represent one-s.d. 
uncertainties. 
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We predict that the solubility of water in silicate melt will increase 
as pressure increases and will be essentially unlimited at all mantle 
pressure—temperature conditions beyond a few gigapascals. This pre- 
diction is based on our finding that the water component behaves 
increasingly ideally as pressure increases. The volume of solution 
AV = d(AG)/dP is <0 at all pressures investigated, so the Gibbs free 
energy of solution AG, which limits solubility, must decrease to a 
small, pressure-independent value as pressure increases to around 
100 GPa, where AV vanishes. As AG is sufficiently small to permit 
complete solubility at 12 GPa (ref. 18), solubility must remain undi- 
minished as pressure increases to at least the highest pressure we 
study here (100 GPa). The dependence of Vy,o on silicate composi- 
tion remains to be investigated at high pressure. However, we note 
that Vi,0 is apparently very insensitive to silicate composition at low 
pressure’, indicating that our arguments regarding solubility may be 
applicable to natural melt compositions as well. 

Essentially unlimited solubility of water in silicate melt over most 
of the mantle regime would have potentially important implications 
for our understanding of Earth’s origins. In most models of the 
accreting Earth, a deep magma ocean, possibly encompassing the 
entire mantle, is an important reservoir of water. Our results show 
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Figure 4 | Density and partial molar volume of water. a, Comparison of the 
density of hydrous (solid lines with symbols) and anhydrous’ (dashed lines) 
melts along 3,000 K (blue) and 6,000 K (red) isotherms. The inset shows the 
density contrast between hydrous and anhydrous melt along 3,000 K (blue) 
and 6,000 K (red) isotherms. For comparison, the grey line shows the density 
contrast between an anhydrous melt and an ideal hydrous melt (AV = 0) at 
3,000 K. b, Partial molar volume of water from our simulations at 6,000 K 
(red circles) and 3,000 K (blue circles), compared with that of pure water*® 
(solid lines). The partial molar volumes of water derived from high-pressure 
experiments (upward triangle’® and downward triangles’’) are shown for 
comparison. The dashed line represents the partial molar volume from this 
study extrapolated to 2,200 K, the experimental temperature in refs 16 and 
17. The inset shows the partial molar volume of water (open circles) that we 
calculate extrapolated along the isochore V/V, = 1 to zero pressure (dashed 
line), in excellent agreement with experiment (solid square’). Error bars 
represent one-s.d. uncertainties. 
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that this reservoir could easily store all the water delivered to Earth 
(probably bounded by the 5—7% by mass present in CI carbonaceous 
chondrites’), which probably exceeds the amount of water in the 
present Earth (0.005-0.02% by mass in the source of mid-ocean- 
ridge basalts*’) by a large factor. Moreover, water stored in the 
magma ocean may have fundamentally altered its thickness, thermal 
state and dynamics, by means of water-induced lowering of the 
solidus and the viscosity. Extensive solubility of water in the early 
magma ocean may have had important implications for the origin of 
the hydrosphere”’. 


METHODS SUMMARY 


The simulations were performed in the canonical ensemble with periodic 
boundary conditions and a Nosé thermostat”’. The primary cell contained 84 
atoms (12 MgSiO; molecules and eight water molecules: 10 wt% water). As our 
initial condition, a pyroxene structure with water molecules placed in the vacant 
space between tetrahedral chains and in the b-c plane was homogeneously 
strained to a cubic shape and the desired volume. The system was melted at 
6,000 K and then cooled isochorically to first 4,000 K and then 3,000 K. We used a 
time step of 0.5 fs. Total run durations were 3 ps at 6,000K, 4.5 ps at 4,000 K 
(except for V/V, values of 0.5 and 0.6, for which the duration was 6 ps) and 6 ps at 
3,000 K (except for a V/V, value of 0.6, for which the duration was 12.5 ps, and a 
V/V, value of 0.5, for which the duration was 15 ps). In all cases the final 80% of 
the trajectories were used to compute averages. 

Self-diffusion coefficients were determined from the slope of the linear 
portion of the plot of mean squared displacement versus time (see the 
Supplementary Information). We confirmed that the simulated properties were 
not significantly affected if we doubled the cell size or used a different initial 
condition. We used ultrasoft pseudopotentials” and VASP”, computing the 
electronic structure at the Brillouin zone centre with an energy cutoff of 
400 eV. Pulay corrections to the pressure were estimated to be the same as in 
the anhydrous case’, varying linearly from 2 GPa at V= V,, to 5 GPa at V= V,/2, 
where V,. = 1,033.54 Ab per simulation cell (84 atoms), identical to the reference 
volume of our anhydrous simulations. To account for the error inherent in the 
approximation to the exchange correlation functional, we added a uniform 
correction of 2GPa to the pressure’. The simulations were confined to the 
Born—Oppenheimer surface and include the influence of the finite temperature 
through the Mermin functional”*”*. 
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Mountain pine beetle and forest carbon feedback to 


climate change 


W. A. Kurz’, C. C. Dymond’, G. Stinson’, G. J. Rampley’, E. T. Neilson’, A. L. Carroll’, T. Ebata” & L. Safranyik’ 


The mountain pine beetle (Dendroctonus ponderosae Hopkins, 
Coleoptera: Curculionidae, Scolytinae) is a native insect of the 
pine forests of western North America, and its populations peri- 
odically erupt into large-scale outbreaks’. During outbreaks, the 
resulting widespread tree mortality reduces forest carbon uptake 
and increases future emissions from the decay of killed trees. The 
impacts of insects on forest carbon dynamics, however, are gene- 
rally ignored in large-scale modelling analyses. The current out- 
break in British Columbia, Canada, is an order of magnitude 
larger in area and severity than all previous recorded outbreaks’. 
Here we estimate that the cumulative impact of the beetle out- 
break in the affected region during 2000-2020 will be 270 mega- 
tonnes (Mt) carbon (or 36gcarbonm “yr ' on average over 
374,000 kin” of forest). This impact converted the forest from a 
small net carbon sink to a large net carbon source both during and 
immediately after the outbreak. In the worst year, the impacts 
resulting from the beetle outbreak in British Columbia were equi- 
valent to ~75% of the average annual direct forest fire emissions 
from all of Canada during 1959-1999. The resulting reduction in 
net primary production was of similar magnitude to increases 
observed during the 1980s and 1990s as a result of global change’. 
Climate change has contributed to the unprecedented extent and 
severity of this outbreak®. Insect outbreaks such as this represent 
an important mechanism by which climate change may under- 
mine the ability of northern forests to take up and store atmo- 
spheric carbon, and such impacts should be accounted for in 
large-scale modelling analyses. 

Forest insect epidemics can have severe impacts on ecosystem 
dynamics by causing mortality and reducing the growth of millions 
of trees over extensive areas’. Native insects and alien invasive species 
affect both managed and natural forests. Beyond the ecological 
impacts are the associated economic (for example, disrupted timber 
supply to mills) and social (for example, unemployment, crime rates) 
effects*. The impact of insects on carbon (C) dynamics and global 
climate are not well documented’. 

The current outbreak of mountain pine beetle in western Canada is 
an order of magnitude greater in area than previous outbreaks owing 
to the increased area of susceptible host (mature pine stands) and 
favourable climate* (see also Supplementary Fig. 3). An expansion in 
climatically suitable habitat for the mountain pine beetle, including 
reduced minimum winter temperature, increased summer tempera- 
tures and reduced summer precipitation, during recent decades has 
facilitated expansion of the outbreak northward and into higher 
elevation forests*’®. This range expansion, combined with an increase 
in the extent of the host, has resulted in an outbreak of unprece- 
dented scale and severity. By the end of 2006, the cumulative out- 
break area was 130,000 km? (many stands being attacked in multiple 
years), with tree mortality ranging from single trees to most of a 


stand in a single year''. Timber losses are estimated to be more 
than 435 million m°, with additional losses outside the commercial 
forest'*. The forest sector has responded by increasing harvest rates 
and reallocating some harvest, increasing the pine portion of the 
provincial total volume harvested from 31% to 45% over four years 
(2001-2004). 

We estimated the combined impact of the beetle, forest fires and 
harvesting on forest productivity and carbon balance from 2000 until 
2020 for the south-central region of British Columbia (Fig. 1). This 
area includes 374,000 km’ of productive forest, largely dominated by 
pine (Pinus) and spruce (Picea) species. We used a Monte Carlo 
design for simulating future net biome production (NBP) using a 
forest ecosystem model (the Carbon Budget Model of the Canadian 
Forest Sector, CBM-CFS3). This model accounts for annual tree 
growth, litterfall, turnover and decay, and it explicitly simulates 
harvest, beetle-caused mortality, and fire-caused mortality and fuel 
consumption. We developed regional probability distribution func- 
tions of the annual area burned and projected future beetle dynamics 
on the basis of the characteristics of the remaining host (that is, pine 
stands of suitable age) and the judgement of regional entomologists. 
We conducted 100 Monte Carlo simulations with different random 
draws from the probability distributions for the annual area of beetle 
outbreak and the annual area burned. 

For the period 2000-2020, the average annual NBP was —15.8 + 
7.9MtCyr | (or —42.4+21gCm “yr '; Fig. 2). Carbon losses 
result from emissions from decomposition and fires and from 
the transfer of timber to the forest product sector. In a separate 
analysis'’, we estimated that the study area was a net sink from 
1990 to 2002. The first two years of this study also reported a net 
sink (0.59 Mt Cyr’ '), but with increasing beetle impact (Fig. 3), the 
forest converted to a source of 17.6MtCyr ‘from 2003 to 2020. 
With decreasing beetle impact (Fig. 3), NBP began to recover, but 
by 2020, the estimated NBP had not yet returned to pre-outbreak 
levels. Although we can expect that forests will eventually recover 
from the beetle outbreak, we are reluctant to extend projections 
beyond 2020 or to speculate on the rate of recovery beyond 2020 
given uncertainties about non-host responses, rates of regeneration, 
and future fires in a region in which major climate change impacts are 
forecast'*. 

One component of the uncertainty in future NBP is that we do not 
know the future area that will be infested by the beetle. We projected 
the area infested during 2007-2020 using random draws from 
regionally calibrated probability distributions of outbreak area and 
duration that were based on: the 2000-2006 area; mortality and host 
statistics; historical, spatial and temporal dynamics; remaining host 
population; and judgment from entomologists. The outbreak was 
projected to peak between 2006 and 2008, with the maximum area 
infested ranging from 74,000 km” to 94,000 km” (Fig. 3). 


'Natural Resources Canada, Canadian Forest Service, Pacific Forestry Centre, Victoria, British Columbia, V8Z 1M5, Canada. British Columbia Ministry of Forests and Range, Victoria, 


British Columbia, V8W 9C2, Canada. 
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Figure 1| Geographic extent of mountain pine beetle outbreak in North 
America. a, Extent (dark red) of mountain pine beetle. b, The study area 
includes 98% of the current outbreak area. c, A photograph taken in 2006 


We partitioned the impact of the beetle outbreak on NBP using a 
single projection of beetle (Fig. 3) and fire area and by simulating 
three forest carbon dynamics scenarios: without beetle (control), 
with beetle but no additional harvest, and with beetle and additional 
harvest. The additional harvest is considered to be part of the beetle 
impact because it is a management response to the vast quantities of 
beetle-killed trees. The impact of beetles on NBP was projected to 
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Figure 2 | Annual NBP by percentile from the Monte Carlo simulations. 
The model results are based on statistics from 2000 to 2006 and projections 
from 2007 to 2020. Negative values represent fluxes from the forest to the 
atmosphere (a net source of carbon). Asymmetry of the range of estimates of 
NBP in any single year is a function of the area burned and the associated 
direct carbon emissions. 
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showing an example of recent mortality: pine trees turn red in the first year 
after beetle kill, and grey in subsequent years. Photo credit: Joan Westfall, 
Entopath Management Ltd. 


reach —20MtCyr ' (—53gCm “yr _') in 2009 (Fig. 4). Cumu- 
lative beetle impact over the 21-yr simulation period was projected 
to be —270 MtC, and the additional salvage harvest and removal of 
both beetle-killed and living trees contributed a further —50 Mt C. 
Cumulative transfer of beetle-killed timber to the forest product 
sector was projected to be 31MtC (13% of total harvest). 
Harvesting results in a loss of carbon from the ecosystem, but only 
some of that carbon is emitted to the atmosphere; the remainder is 
stored in wood products and landfills. 

NBP was reduced in the affected landscape by the beetle and the 
associated additional harvest response because of reduced net 
primary production (NPP) and increased heterotrophic respiration. 
Annual NPP dropped from 440 to 400gCm in 2000-2009. The 
lowest NPP values (391 gCm ”) were estimated for 2015 to 2018, 
at which point NPP started to recover. NPP was reduced in the 
affected landscape during and after the beetle outbreak because of 
the reduction in photosynthetic capacity caused by widespread tree 
mortality. Conversely, heterotrophic respiration increased from 408 
to 424gCm ~” in the 2003-2007 period because of the large transfers 
of biomass to dead organic matter pools (471 Mt C transferred by the 
beetle during the outbreak) and subsequent decomposition. 

Impacts of the current beetle outbreak are important because they 
are of comparable magnitude to other global change factors. Fire 
emissions vary widely between years, but within the study area the 
worst year for fires in the simulation period produced direct emis- 
sions of 13 MtC, an amount exceeded by annual beetle-caused 
impacts from 2005 to 2014. The maximum annual beetle impact 
(20 Mt Cyr ') for the relatively small study area is of similar mag- 
nitude to the direct forest fire emissions from all of Canada during 
1959-1999 (27 Mt Cyr_')'®. Climate warming, elevated atmospheric 
CO, concentrations, and atmospheric nitrogen deposition may 
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Figure 3 | Area infested with the mountain pine beetle during the 
simulation period. Statistics are used from 2000 to 2006 and projections are 
used from 2007 to 2020. a, Percentiles describing the parameter space 
resulting from 100 Monte Carlo projections of the beetle-infested area. Our 
projections of decreasing area after 2009 are largely based on the decline in 
available live host (pine) area. b, Area statistics for the single Monte Carlo 
simulation used for scenario analysis broken down by host mortality class. 


enhance forest carbon sinks'®'”. For example, it was estimated in ref. 
5 that NPP increased 6% during the 1980s and 1990s as a result of 
global change. The impact of the beetle outbreak would negate these 
gains within the affected region because NPP was reduced by 10% 
and heterotrophic respiration increased by 6%. Here we examined 
just one insect outbreak and estimated that its net greenhouse gas 
impacts over 21 yr (990 Mt carbon dioxide equivalents, CO e) are 
comparable to about 5 yr of emissions from Canada’s transportation 
sector (200 Mt COze in 2005)”. 

Climate change will have impacts on insect distribution and 
abundance either directly via effects on their life cycles or indirectly 
via host—plant defences, the abundance of natural enemies or inter- 
actions with competitors'*. For some forest insects, the outcome may 
be changes in: outbreak frequencies or duration; rates of herbivory 
and damage; ranges, and associations with host species ''’’. Evidence 
for climate-change-related increases in the extent or severity of forest 
insect disturbance has begun to accumulate. In north-western North 
America, warmer temperatures have halved the time required to 
reproduce for the spruce beetle (Dendroctonus rufipennis Kirby, 
Coleoptera: Curculionidae, Scolytinae) and have contributed to 
unprecedented damage to spruce forests”. In the western United 
States, warmer annual temperatures have caused an altitudinal shift 
in the range of habitats, allowing the mountain pine beetle to invade 
high-elevation pine forests*’. Insect impacts have, however, generally 
been ignored in large-scale carbon budget modelling studies*””* 
because data on insect impacts covering large areas are expensive 
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Figure 4 | Total ecosystem carbon stock change for three scenarios. The 
control simulation was run with no beetle outbreak, and with base harvest 
and fires. The beetle simulation added insect impacts to the control scenario. 
The additional harvest simulation added the management response of 
increased harvest levels from 2006 to 2016 to the beetle simulation. Negative 
ecosystem carbon stock change values represent fluxes from the forest to the 
atmosphere (net source of carbon). The source in 2003 was, in part, the 
result of the large area burned (2,440 km? in the study area) that was 
included in all three scenarios. 


to collect and are therefore not widely available. Moreover, complex 
modelling approaches are required to adequately represent the 
impacts of insects on stand- and landscape-level forest ecosystem 
dynamics. This failure to account for insect impacts may have 
resulted in overestimation in previous studies of the potential for 
forests to offset anthropogenic CO). 

Other insects, such as eastern spruce budworm (Choristoneura 
fumiferana Clemens, Lepidoptera: Tortricidae) and forest tent cater- 
pillar (Malacosoma disstria Hubner, Lepidoptera: Lasiocampidae) 
also exhibit outbreaks during which they affect forest carbon 
dynamics by reducing growth and increasing tree mortality over large 
areas’. Other forms of disturbance, such as fires, also markedly 
influence terrestrial carbon budgets”®. Significant climate warming 
has already allowed the mountain pine beetle to expand its range 
into formerly climatically unsuitable habitats**'. Future projected 
warming could facilitate further range expansion. Insect outbreaks, 
together with fires, could put North American forest carbon sinks at 
risk’. Disturbances are one of the principal drivers of the carbon 
budget in northern forests”, and this study shows but one example 
of how climate change can affect disturbance regimes for which 
impacts on the global carbon cycle then provide strong positive feed- 
back to the global climate system. 


METHODS SUMMARY 


The CBM-CFS3 uses forest inventory as well as growth and yield data to simulate 
stand- and landscape-level forest carbon dynamics**’. It tracks carbon stocks, 
transfers between pools and emissions. The model accounts for tree growth, 
litterfall, turnover and decay as annual components of the forest carbon cycle. 
Fires were simulated as stand-replacing events where a portion of foliage and 
above-ground dead organic matter pools were consumed; all remaining biomass 
was killed, and standing snags were transferred to litter and coarse woody debris. 
Harvests were also simulated as stand-replacing clear-cut events where stem- 
wood carbon was transferred out of the ecosystem to the forest products sector, 
and the rest of the carbon in biomass pools was transferred to pools of dead 
organic matter. Beetle infestation areas were simulated as partial-mortality 
events killing a portion of softwood biomass pools. The remaining stand con- 
tinues to grow and recover from beetle impacts. The stand mortality classes were 
5%, 10%, 30% or 50% per year (see Supplementary Information for more 
details). 

The British Columbia Ministry of Forests and Range supplied forest inventory 
and growth curve data. Almost 700,000 forest stands were assigned to 86 spatial 
analysis units that represented the administrative and ecological variability 
in the study area. Spatially and temporally explicit beetle outbreak area and 
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area-burned data sets were used for 2000-2006 and projected for 2007-2020 
using random draws from historically calibrated probability distributions. 
Harvest rates for each administrative unit were available for 2000-2005, and 
forecasts for 2006-2020 were based on local management plans (see 
Supplementary Information for more details). 
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The draft genome of the transgenic tropical fruit tree 
papaya (Carica papaya Linnaeus) 
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Papaya, a fruit crop cultivated in tropical and subtropical regions, 
is known for its nutritional benefits and medicinal applications. 
Here we report a 3X draft genome sequence of ‘SunUp’ papaya, 
the first commercial virus-resistant transgenic fruit tree’ to be 
sequenced. The papaya genome is three times the size of the 
Arabidopsis genome, but contains fewer genes, including signifi- 
cantly fewer disease-resistance gene analogues. Comparison of the 
five sequenced genomes suggests a minimal angiosperm gene set 
of 13,311. A lack of recent genome duplication, atypical of other 
angiosperm genomes sequenced so far*°, may account for the 
smaller papaya gene number in most functional groups. None- 
theless, striking amplifications in gene number within particular 
functional groups suggest roles in the evolution of tree-like habit, 
deposition and remobilization of starch reserves, attraction of 
seed dispersal agents, and adaptation to tropical daylengths. 
Transgenesis at three locations is closely associated with chloro- 
plast insertions into the nuclear genome, and with topoisomerase I 
recognition sites. Papaya offers numerous advantages as a system 
for fruit-tree functional genomics, and this draft genome sequence 


provides the foundation for revealing the basis of Carica’s 
distinguishing morpho-physiological, medicinal and nutritional 
properties. 

Papaya is an exceptionally promising system for the exploration of 
tropical-tree genomes and fruit-tree genomics. It has a relatively 
small genome of 372 megabases (Mb)’°, diploid inheritance with nine 
pairs of chromosomes, a well-established transformation system’, 
a short generation time (9-15 months), continuous flowering 
throughout the year and a primitive sex-chromosome system‘*. It is 
a member of the Brassicales, sharing a common ancestor with 
Arabidopsis about 72 million years ago’. Papaya is ranked first on 
nutritional scores among 38 common fruits, based on the percentage 
of the United States Recommended Daily Allowance for vitamin A, 
vitamin C, potassium, folate, niacin, thiamine, riboflavin, iron and 
calcium, plus fibre. Consumption of its fruit is recommended for 
preventing vitamin A deficiency, a cause of childhood blindness in 
tropical and subtropical developing countries. The fruit, stems, leaves 
and roots of papaya are used in a wide range of medical applications, 
including production of papain, a valuable proteolytic enzyme. 
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A total of 2.8 million whole-genome shotgun (WGS) sequencing 
reads were generated from a female plant of transgenic cultivar 
SunUp, which was developed through transformation of Sunset that 
had undergone more than 25 generations of inbreeding’®. The esti- 
mated residual heterozygosity of SunUp is 0.06% (Supplementary 
Note 1). After excluding low-quality and organellar reads, 1.6 million 
high-quality reads were assembled into contigs containing 271 Mb 
and scaffolds spanning 370 Mb including embedded gaps (Supple- 
mentary Tables 1 and 2). Of 16,362 unigenes derived from expressed 
sequence tags (ESTs), 15,064 (92.1%) matched this assembly. Paired- 
end reads from 34,065 bacterial artificial chromosome (BAC) clones 
provided alignment to an fingerprinted contig (FPC)-based physical 
map (Supplementary Note 2). Among 706 BAC end and WGS 
sequence-derived simple sequence repeats on the genetic map, 652 
(92.4%) could be used to anchor 167 Mb of contigs or 235 Mb of 
scaffolds, to the 12 papaya linkage groups in the current genetic map 
(Supplementary Fig. 1). 

Papaya chromosomes at the pachytene stage of meiosis are 
generally stained lightly by 4',6-diamidino-2-phenylindole (DAPI), 
revealing that the papaya genome is largely euchromatic. However, 
highly condensed heterochromatin knobs were observed on most 
chromosomes (Supplementary Fig. 2), concentrated in the centro- 
meric and pericentromeric regions. The lengths of the pachytene 
bivalents that are heavily stained only account for approximately 
17% of the genome. However, these cytologically distinct and highly 
condensed heterochromatic regions could represent 30-35% of the 
genomic DNA". A large portion of the heterochromatic DNA was 
probably not covered by the WGS sequence. The 271 Mb of contig 
sequence should represent about 75% of the papaya genome and 
more than 90% of the euchromatic regions, which is similar to the 
92.1% of the EST and 92.4% of genetic markers covered by the 
assembled genome and the theoretical 95% coverage by 3X WGS 
sequence”. 

Gene annotation was carried out using the TIGR Eukaryotic 
Annotation Pipeline. The assembled genome was masked based on 
similarity to known repeat elements in RepBase and the TIGR Plant 
Repeat Database, plus a de novo papaya repeat database (see 
Methods). Ab initio gene predictions were combined with spliced 
alignments of proteins and transcripts to produce a reference gene 
set of 28,629 gene models (Supplementary Table 3). A total of 21,784 
(76.1%) of the predicted papaya genes with average length of 1,057 
base pairs (bp) have similarity to proteins in the non-redundant 
database from the National Center for Biotechnology Information, 
with 9,760 (44.8%) of these supported by papaya unigenes. Among 
6,845 genes with average length 309 bp that had no hits to the non- 
redundant proteins, only 515 (7.5%) were supported by papaya uni- 
genes, implying that the number of predicted papaya-specific genes 
was inflated. If the 515 genes with unigene support represent 44.8% 
of the total, then 1,150 predicted papaya-specific genes may be real, 
and the number of predicted genes in the assembled papaya genome 
would be 22,934. Considering the assembled genome covers 92.1% of 
the unigenes and 92.4% of the mapped genetic markers, the number 
of predicted genes in the papaya genome could be 7.9% higher, or 
24,746, about 11-20% less than Arabidopsis (based on either the 


Table 1| Statistics of sequenced plant genomes 


Carica Arabidopsis Populus Oryza sativa Vitis 


papaya thaliana trichocarpa (japonica) vinifera 
Size (Mbp) 372 125 485 389 487 
Number of chromosomes 9 5 19 12 19 
G + C content total (%) 35.3 35.0 33.3 43.0 36.2 
Gene number 24,746 31,114* 45555 37,544 30,434 
Average gene length (bp per gene) 2,373 2,232 2,300 2821 3,399 
Average intron length (bp) 479 165 379 412 213 
Transposons (%) 519 14 42 34.8 41.4 


* The gene number of Arabidopsis is based on the 27,873 protein-coding and RNA genes from 
The Arabidopsis Information Resource website (http://www.arabidopsis.org/portals/ 
genAnnotation/genome_snapshot.jsp) and recently published 3,241 novel genes°®. 
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27,873 protein coding and RNA genes, or including the 3,241 novel 
genes)”'*, 34% less than rice’, 46% less than poplar* and 19% less 
than grape’ (Table 1). 

Comparison of the papaya genome with that of Arabidopsis 
sheds new light on angiosperm evolutionary history in several ways. 
Considering only the 200 longest papaya scaffolds, we found 121 co- 
linear blocks. The papaya blocks range in size from 1.36 Mb contain- 
ing 181 genes to 0.16 Mb containing 19 genes (a statistical, rather 
than a biological, lower limit); the corresponding Arabidopsis regions 
range from 0.69 Mb containing 163 genes to 60 kilobases (kb) con- 
taining 18 genes. Across the 121 papaya segments for which co- 
linearity can be detected, 26 show primary correspondence (that is, 
excluding the effects of ancient triplication detailed below) to only 
one Arabidopsis segment, 41 to two, 21 to three, 30 to four, and only 3 
to more than four. 

The fact that many papaya segments show co-linearity with two to 
four Arabidopsis segments (Fig. 1, and Supplementary Figs 3 and 4) is 
most parsimoniously explained if either one or two genome duplica- 
tions have affected the Arabidopsis lineage since its divergence from 
papaya. Although it was suspected that the most recent Arabidopsis 
genome duplication, «"*, might affect only a subset of the 
Brassicales'’, previous phylogenetic dating of these events’’ had sug- 
gested that the more ancient B-duplication occurred early in the 
eudicot radiation, well before the Arabidopsis—Carica divergence. 
This incongruity is under investigation. 

In contrast, individual Arabidopsis gnome segments correspond 
to only one papaya segment, indicating that no genome duplication 
has occurred in the papaya lineage since its divergence from 
Arabidopsis about 72 million years ago’. The lack of relatively recent 
papaya genome doubling is further supported by an L-shaped distri- 
bution of intra-EST correspondence for papaya (not shown). How- 
ever, multiple genome/subgenome alignments (see Supplementary 
Methods) reveal evidence in papaya of the ancient ‘y’ genome 
duplication shared with Arabidopsis and poplar that is postulated 
to have occurred near the origin of angiosperms". Indeed, both 
papaya (with no subsequent duplication) and poplar (with a rela- 
tively low rate of duplicate gene loss) suggest that y was not a duplica- 
tion but a triplication (Fig. 1), with triplicated patterns evident for 
about 25% of the 247 Mb comprising the 200 largest papaya scaffolds. 
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Figure 1| Alignment of co-linear regions from Arabidopsis (green), papaya 
(magenta), poplar (blue) and grape (red). ‘Vv chr16r’ is an unordered 
ultracontig that has been assigned to grape chromosome 16. Triangles 
represent individual genes with transcriptional orientations. Several 
Arabidopsis regions belong to previously identified duplication segments 
(%3, «11, 020, 86, y7, shown to the right)”*. The whole syntenic alignment 
supports four distinct whole-genome duplication events: «, B within the 
Arabidopsis lineage, an independent duplication in poplar, and y which is 
shared by all four eudicot genomes. Co-linear regions can be grouped into 
three y sub-genomes based on Camin—Sokal parsimony criteria. 
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This is most probably an underestimate that will increase as papaya 
contiguity is improved. Triplication in papaya and poplar corre- 
sponds closely to the triplication suggested by an independent ana- 
lysis of the grape genome’. 

A few hundred papaya chromosomal segments were aligned using 
BLASTZ to their one to four syntenic regions in Arabidopsis, and the 
results examined visually using the Genome Evolution (GEvo) 
viewer’. The orthologous region of grape was also included’, making 
the alignment a six-way comparison. One example is given in 
Supplementary Fig. 5: a 500kb segment of papaya, its four 60 kb 
syntenic, orthologous Arabidopsis segments and the 400 kb ortholo- 
gous segment of grape. 

For the homologous Arabidopsis segments that are discernibly co- 
linear (by MC-SCANNER) to the 200 longest papaya scaffolds, 
34.8% of Arabidopsis genes in any one segment correspond to a 
papaya gene, whereas only 24.8% of papaya genes in any one segment 
correspond to an Arabidopsis gene. Moreover, the Arabidopsis homo- 
logous segments contain fewer genes, on average only about 57.9% of 
the number in their papaya counterparts. 

Papaya provides a useful outgroup necessary to detect subfunctio- 
nalization. Supplementary Fig. 6 is a GEvo screenshot of a blastn 
alignment illustrating subfunctionalization of conserved non-coding 
sequences (CNSs)'” upstream of two syntenic, duplicate Arabidopsis 
genes and their single papaya orthologous gene. The o-duplicated 
genomes within Arabidopsis are perfect for CNS discovery’. 

Comparative analysis of the papaya and Arabidopsis 5’ untrans- 
lated regions showed that only 14% of orthologous promoter pairs 
exhibit significantly higher levels of sequence identity than random 
comparisons (Supplementary Figs 7 and 8). Although some highly 
conserved promoters show substantial conservation across much of 
their length, sequence similarity for most orthologous papaya pro- 
moters is indistinguishable from background. 

Global analysis of all inferred protein models from papaya, 
Arabidopsis, poplar, grape and rice clusters the 208,901 non-redundant 
protein sequences into 39,706 similarity groups, or ‘tribes’, 11,851 of 
which contain two or more genes (see Supplementary Methods). Tribes 
with multiple genes in a species typically correspond to families or 
subfamilies of genes; however, tribes may also contain just one gene 
(‘singleton tribes’). In papaya, 25,312 gene models were classified into 
12,958 tribes, 5,669 of which were specific to papaya (Supplementary 
Table 4). Of the papaya-specific tribes, 5,314 were singleton tribes. EST 
support was markedly lower for genes in papaya-specific tribes (below 
14%) than in tribes that included genes from at least one other taxon 
(72.4%). 

To investigate the smaller number of genes in papaya, we com- 
pared tribe membership from each of the five sequenced angiosperm 
species (Supplementary Table 5). Among the 6,726 tribes that con- 
tain genes from both Arabidopsis and papaya, 3,595 contain equal 
numbers of genes from both species. However, tribes with more 
Arabidopsis genes outnumber those with more papaya genes by more 
than 2:1 (2,153:979). The trend of smaller number of papaya genes is 
widespread across tribes of all sizes and major functional categories 
(Supplementary Table 6 and Supplementary Fig. 9). 

We then examined membership in the 815 tribes with members 
identified as being likely transcription factors in the Arabidopsis tran- 
scription factor database (http://arabidopsis.med.ohio-state.edu/ 
AtTFDB/). This set includes 2,897 genes in Arabidopsis and 2,438 
in papaya (a ratio of 1.19:1). The details of tribe membership are 
illustrated for 25 exemplar families and superfamilies (Fig. 2), where 
most transcription-factor tribes have fewer genes in papaya than 
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Figure 2 | Comparison of gene numbers in transcription-factor tribe or 
related tribes from Arabidopsis and papaya. Most transcription factors are 
represented by fewer genes in papaya than Arabidopsis. Transcription-factor 
names are given, with values after the names corresponding to: number of 
tribes with genes assigned to transcription factor group, number of tribes 
with smaller counts in papaya than Arabidopsis, number of tribes with equal 
counts in papaya and Arabidopsis, number of tribes with larger counts in 
papaya, and number of tribes with zero members in papaya. Supporting data 
are provided in Supplementary Table 8. 


Arabidopsis. Some transcription-factor tribes had more genes in 
papaya, specifically RWP-RK, MADS-box, Scarecrow, TCP and 
Jumonji gene families. Interestingly, the difference in MADS protein 
family size appears to be due to expanded numbers for half of the 36 
MADS tribes. The other 18 MADS tribes had fewer papaya genes, 
including 14 that were not found in papaya. 

Assuming that a generalized angiosperm could potentially require 
only the types and minimal numbers of genes that are shared among 
divergent plant species, we examined each of the tribes shared among 
the five angiosperms with sequenced genomes. The number of genes 
required in a minimal flowering plant is based on the observed 
minimum number of genes across each of the shared tribes 
(Table 2). When the smallest observed number is taken for each 
evolutionarily conserved tribe, a minimal angiosperm genome of 
13,311 genes is estimated. Papaya has the smallest number of genes 
for more tribes than any other sequenced taxon (4,515, or 76% of 
5,925 shared tribes), reinforcing the notion that papaya has fewer 
genes than any angiosperm sequenced so far. 

Only 55 nucleotide-binding site (NBS)-containing R genes were 
identified in papaya; about 28% of the 200 NBS genes in Arabidopsis” 
and less than 10% of the 600 NBS genes in rice*’. Resistance proteins 
also have a carboxy-terminal leucine-rich repeat (LRR) domain. 
These NBS-containing R-gene families can be subdivided into three 
classes: NBS—-LRR, toll interleukin receptor (TIR)-NBS-LRR, and 
coiled-coil (CC)—-NBS-LRR on the basis of their amino-terminal 
region. Papaya NBS-LRR outnumbered both TIR-NBS-LRR and 
CC-NBS-LRR genes, in contrast to both poplar (with more CC- 
NBS-LRR genes*) and Arabidopsis (with more TIR-NBS-LRR). 
More than 50% of the NBS-type R genes were clustered in about 
eight scaffolds, indicating that resistance gene evolution may involve 
duplication and divergence of linked gene families. 


Table 2 | Deduced potential minimal angiosperm gene number based on species with smallest number of genes for each tribe 


Carica papaya Arabidopsis thaliana _— Populus trichocarpa Oryza sativa (japonica) Vitis vinifera Shared tribes Minimal gene number 
Shared tribes with minimum 4,515 3,597 1,548 3,657 3,597 5,925 13,331 
Number of unique tribes 5,708 2,950 6,338 13,003 3,567 
Number of conserved tribes lost or 405 113 28 429 175 


missing from each species 
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Homologues for genes involved in cellulose biosynthesis are pre- 
sent in papaya and Arabidopsis, with more cellulose synthase genes in 
poplar, perhaps associated with wood formation. Papaya has at least 
32 putative B-glucosyl transferase (GT1) genes compared with 121 in 
Arabidopsis identified using sequence alignment. A total of 38 and 40 
cellulose synthase-related genes (GT2) were identified in papaya 
using the 48 poplar and 31 Arabidopsis genes as queries, respectively. 
These genes include 11 cellulose synthase (CesA) genes, the same 
number as in Arabidopsis but 7 fewer than in poplar. Putative cel- 
lulose orientation genes (COBRA) were more abundant in 
Arabidopsis (12) than in papaya (8). 

Papaya also has a similar complement though fewer genes for cell- 
wall synthesis than Arabidopsis. Papaya and Arabidopsis, respectively, 
have 6 and 12 callose synthase genes (GT2); 15 and 15 xyloglucan 
o.-1,2-fucosyl transferases (GT37); 5 and 7 B-glucuronic acid trans- 
ferases in familes GT43 and GT47; and 27 and 42 in GT8 that includes 
galacturonosyl transferases, associated with pectin synthesis. 

The cell wall of plants is capable of both plastic and elastic exten- 
sion, and controls the rate and direction of cell expansion”. Despite 
fewer whole-genome duplications, papaya has a similar number of 
putative expansin A genes (24) as Arabidopsis (26) and poplar (27), 
and more expansin B genes (10) than Arabidopsis (6) and poplar (3). 

In contrast to expansion-related genes, papaya has on average 
about 25% fewer cell-wall degradation genes than Arabidopsis, in 
some cases far fewer. For example, papaya and Arabidopsis, respec- 
tively, have 4 and 12 endoxylanase-like genes in glycoside hydrolase 
family 10 (GH10); 29 and 67 pectin methyl esterases (carbohydrate 
esterase family 8); 28 and 69 polygalacturonases (GH28); 15 and 49 
xyloglucan endotransglycosylase/hydrolases (GH16); 18 and 25 
B-1,4-endoglucanases (GH9); 42 and 91 B-1,3-glucanases (GH17); 
and 15 and 27 pectin lyases (PL1). 

A semi-woody giant herb that accumulates lignin in the cell wall at 
an intermediate level between Arabidopsis and poplar, papaya gene- 
rally has intermediate numbers of lignin synthetic genes, fewer than 
poplar but more than Arabidopsis despite fewer opportunities for 
duplication in papaya. Poplar, papaya and Arabidopsis have 37, 30 
and 18 candidate genes for the lignin synthesis pathway, respec- 
tively*”, with papaya having an intermediate number of genes for 
the PAL, C4H, 4CL and HCT gene families, and only one COMT and 
two C3H genes. In contrast, poplar has three C3H genes, which are 
presumed to convert p-coumaroyl quinic acid to caffeoyl shikimic 
acid, whereas there are two in papaya and one in Arabidopsis. Papaya, 
Arabidopsis and poplar each have two genes in the family 
CCoAOMT, which are presumed to convert caffeic acid to ferulic 
acid*. Compared with these other plants, papaya has the fewest genes 
in the CCR gene family (1 gene) and the most in the F5H (4 genes) 
and CAD gene families (18 genes), which all mediate later steps of the 
lignin biosynthesis pathway. 

More starch-associated genes in papaya, a perennial, may be due to 
a greater need for storage in leaves, stem and developing fruit than in 
Arabidopsis, an ephemeral that stores oil in the seed. Papaya and 
Arabidopsis, respectively, have 13 and 6 putative starch synthase 
(GT5) genes; 8 and 3 starch branching genes; 6 and 3 isoamylases 
(GH13); and 12 and 9 B-amylases (GH14). Early unloading of fruit 
sugar in papaya is probably symplastic™, with five genes for sucrose 
synthase/sucrose phosphate synthase (GT4); seven are reported for 
Arabidopsis. Five acid invertase (GH32) sequences were found in 
papaya whereas 11 have been reported in Arabidopsis. Papaya has 
at least seven putative neutral invertase (GH32) genes; Arabidopsis 
has six. Wall-associated kinases (WAK) are thought to be involved in 
the regulation of vacuolar invertases, with 17 in Arabidopsis and only 
10 in papaya. Arabidopsis and papaya have 14 and 7 hexose trans- 
porters, respectively. The greater number of genes for sugar accu- 
mulation in Arabidopsis may reflect recent genome duplications. 

Papaya has undergone particularly striking amplification of genes 
involved in volatile development. Papaya and Arabidopsis, respec- 
tively, have 18 and 8 genes for cinnamyl alcohol dehydrogenase; 2 
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and 1 genes for cinnamate-4-hydroxylase; 9 and 3 genes for pheny- 
lalanine ammonia lyase; and 24 and 3 limonene cyclase genes. 

Papaya ripening is climacteric, with the rise in ethylene production 
occurring at the same time as the respiratory increase*. Papaya and 
Arabidopsis, respectively, have similar numbers of genes involved in 
ethylene synthesis, with four each for S-adenosyl methionine 
synthase (SAM synthase); 8 and 13 for aminocyclopropane carb- 
oxylic acid (ACC) synthase (ACS); 8 and 12 for ACC oxidase 
(ACO); and 42 and 64 for ethylene-responsive binding factors 
(AP2/ERE). 

Because papaya grows in tropical climates where daily light/dark 
cycles do not change much over the year, we can ask if more or fewer 
light/circadian genes are required to synchronize with the environ- 
ment. In fact, there are fewer light/clock genes in the papaya genome 
(49% and 34% of poplar and Arabidopsis, respectively; Supplemen- 
tary Table 7). However, among the core circadian clock genes, the 
pseudo-response regulators (PRRs; Supplementary Fig. 10) have 
expanded in poplar compared with Arabidopsis, and the papaya 
PRR7 cluster has seemingly duplicated with the recent poplar 
salicoid-specific genome duplication* (Supplementary Fig. 11). 
Against the backdrop of fewer overall genes, the parallel expansion 
of the PRRs is consistent with circadian timing being important in 
papaya. 

The PAS-FBOX-KELCH genes control light signalling and 
flowering time; however, the only papaya orthologue (ZTL) lacks 
an obvious KELCH domain compared with Arabidopsis and poplar, 
which have five and one KELCH domains, respectively (Supplemen- 
tary Fig. 10). In fact, the papaya genome contains fewer KELCH 
domains (37 compared with 130 and 74 in Arabidopsis and poplar, 
respectively). In contrast, there are three constitutive photomorpho- 
genic 1 (COP1) paralogues in the papaya genome compared with 
only one in Arabidopsis (Supplementary Tables 7 and 8). A similar 
expansion has been noted in moss (Physcomitrella patens), which has 
nine COP1 paralogues that are hypothesized to aid in tolerance to 
ultraviolet light (Supplementary Fig. 12)*°. Both KELCH domains 
and the WD-40 of the COP1 family form B-propellers and play a role 
in light-mediated ubiquitination. There is not a general expansion of 
WD-40 genes in papaya (173 compared with 227 in Arabidopsis). 
Perhaps papaya has developed an alternative way of integrating light 
or timing information specific to day-neutral plants, such as a strict 
adherence to the diel light/dark cycle that is better served by the COP- 
mediated system. 

Sex determination in papaya is controlled by a pair of primitive 
sex chromosomes, with a small male-specific region of the Y chro- 
mosome (MSY)*. The physical map of the MSY is currently estimated 
by chromosome walking to span about 8 Mb (ref. 27). Two scaffolds 
in the current female-genome sequence align to the X chromosome 
physical map based on BAC end sequences, spanning 4.5 Mb and 
including 254 predicted protein-encoding genes, of which 75 
(29.5%) have EST support (Supplementary Table 9 and Supple- 
mentary Fig. 13). If adjusted for the percentage of unigene validation 
for other genes (48.0%), the estimated number of genes in the 
X-specific region would be 156. The average gene density would be 
one gene per 19.5 kb, lower than the estimated genome average of one 
gene per 14.3kb. By contrast, among seven completely sequenced 
MSY BACs totalling 1.2 Mb, a total of four expressed genes were 
found on two of the BACs'*’*. The somewhat lower-than-average 
gene density in the X-specific scaffolds is accompanied by more 
repetitive DNA (58.3%) than the genome-wide average, perhaps 
because this region is near the centromere”. Re-analysis of 
the repetitive DNA content of the MSY BACs, to include the new 
papaya-specific repeat families identified herein, increased the aver- 
age repeat sequence to 85.6%, with 54.1% Gypsy and 1.9% Copia 
retro-elements (Supplementary Table 10). This compares with an 
earlier estimate of 17.9% using the Arabidopsis repeat database 
alone”*. 
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The SunUp genome has presented an opportunity to analyse trans- 
gene insertion sites critically. Southern blot analysis was key in the 
initial identification of transgenic insertion fragments and was per- 
formed with probes spanning the entire 19,567-bp transformation 
vector used for bombardment (Supplementary Fig. 14). Among the 
identified inserts were the functional coat-protein transgene confer- 
ring resistance to papaya ringspot virus, which was found in an intact 
9,789-bp fragment of the transformation plasmid, and a 1,533-bp 
fragment composed of a truncated, non-functional tetA gene and 
flanking vector backbone sequence. The structures of the coat- 
protein transgene and tetA region insertion sites were determined 
from cloned sequences. Southern analysis also confirmed a 290-bp 
non-functional fragment of the np?II gene originally identified by 
WGS sequence analysis (Supplementary Fig. 15). Five of the six 
flanking sequences of the three insertions are nuclear DNA copies 
of papaya chloroplast DNA fragments. The integration of the trans- 
genes into chloroplast DNA-like sequences may be related to the 
observation that transgenes produced either by Agrobacterium- 
mediated or biolistic transformation are often inserted in AT-rich 
DNA”, as is the chloroplast DNA of papaya and other land plants. 
Four of the six insert junctions have sequences that match topoisome- 
rase I recognition sites, which are associated with breakpoints in geno- 
mic DNA transgene insertion sites and transgene rearrangements”. 
The presence of these inserts was confirmed by high-throughput 
MUMmer”™ analysis for each region of the transformation vector. 
Evidence for the presence of other transgene inserts is not conclusive 
(Supplementary Note 3). 

Its lower overall gene number notwithstanding, striking variations 
in gene number within particular functional groups, superimposed 
on the average approximate 20% reduction in papaya gene number 
relative to Arabidopsis, may be related to key features of papaya 
morphological evolution. Despite a closer evolutionary relationship 
to Arabidopsis, papaya shares with poplar an increased number of 
genes associated with cell expansion, consistent with larger plant size; 
and lignin biosynthesis, consistent with the convergent evolution of 
tree-like habit. Amplification of starch-synthesis genes in papaya 
relative to Arabidopsis is consistent with a greater need for storage 
in leaves, stem and developing fruit of this perennial. Tremendous 
amplification in papaya of genes related to volatile development 
implies strong natural selection for enhanced attractants that may 
be key to fruit (seed) dispersal by animals and which may also have 
attracted the attention of aboriginal peoples. This also foreshadows 
what we might expect to discover in the genomes of other fragrant- 
fruited trees, as well as plants with striking fragrance of leaves (herbs), 
flowers or other organs. 

Arguably, the sequencing of the genome of SunUp papaya makes it 
the best-characterized commercial transgenic crop. Because papaya 
ringspot virus is widespread in nearly all papaya-growing regions, 
SunUp could serve as a transgenic germplasm source that could be 
used to breed suitable cultivars resistant to the virus in various parts 
of the world. The characterization of the precise transgenic modifica- 
tions in SunUp papaya should also serve to lower regulatory barriers 
currently in place in some countries. 


METHODS SUMMARY 

Gene annotation. Papaya unigenes from complementary DNA were aligned to 
the unmasked genome assembly, which was then used in training ab initio gene 
prediction software. Spliced alignments of proteins from the plant division of 
GenBank, and transcripts from related angiosperms, were generated. Gene pre- 
dictions were combined with spliced alignments of proteins and transcripts to 
produce a reference gene set. Detailed descriptions are given in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Genome assembly. The Genome sequence was assembled by Arachne*!. WGS 
reads and BAC end reads were trimmed by LUCY and screened for organellar 
sequences”. Two approaches were applied to screening and removing reads of 
presumably organellar origin to alleviate the load in assembling highly repetitive 
regions by WGS assembly software. The first approach was an iterative process, 
in which reads were assembled, contigs matching with organellar genomes iden- 
tified, constituent reads removed, and the process repeated by two or three more 
rounds. This approach produced the read sets for the released assemblies 
Stripped3 and Stripped4. The second approach was to remove plasmid clones 
and BAC clones of presumably organellar origin by identifying clones with both 
end reads matching entirely with organellar genomes, with physical map 
information an amendment to the identification of BAC clones. Two rounds 
of iterative screening based on pairing information of assembled and unplaced 
reads were added to the second approach to generate the read set for the released 
Papayal.0 assembly. 

The sequence error rates were estimated by aligning assembled shotgun 
sequences with two finished BACs (GenBank accession numbers EF661023 
and EF661026). The error rate of the assembly at 3X coverage or deeper 
(74.2% of assembled sequences) was less than 0.01% based on average quality 
values of 20 or greater in trimmed sequence. The error rate at 2 coverage 
(16.3%) was 0.37%. The error rate at 1X coverage (9.5%) was approximately 
0.75%, because these sequences are at the ends of the contigs (and sequence 
reads) where the sequence quality declined. 

Genome annotation. Gene annotation was conducted following the TIGR 
Eukaryotic Annotation Pipeline. Repeat sequences were identified in the 
assembled genome and masked by RepeatMasker, RepeatScout and 
TransposonPSI, based on known repeat elements in RepBase databases and 
TIGR Plant Repeat Databases, and the papaya novel repeat database constructed 
in this study****. Program to Assemble Spliced Alignments (PASA)** was used to 
generate spliced alignments of papaya unigenes to the unmasked assembly, which 
was then used in training ab initio gene prediction software Augustus, 
GlimmerHMM and SNAP****. Ab initio gene prediction software Fgenesh, 
Genscan and TWINSCAN were trained on Arabidopsis”. Spliced alignments 
of proteins from the plant division of GenBank and transcripts from related 
angiosperms (Arabidopsis thaliana, Glycine max, Gossypium hirsutum, Medicago 
truncatula, Nicotiana tabacum, Oryza sativa, Zea mays) were generated by the 
Analysis and Annotation Tool (AAT)**. Spliced alignment of proteins from the 
Pfam database were generated using GeneWise***. Gene predictions generated by 
Augustus, Fgenesh, Genscan, GlimmerHMM, SNAP and TWINSCAN were com- 
bined with spliced alignments of proteins and transcripts to produce a reference 
gene set using the evidence-based combiner EVidenceModeler (EVM)*°. Protein 
domains were predicted using InterProScan against protein databases (PRINTS, 
Pfam, ProDom, PROSITE, SMART)**. 

Construction of papaya repeat database. We used a combination of homology- 
based and de novo methods to identify signatures of transposable elements in the 
papaya genome. We used RepeatMasker (http://www.repeatmasker.org) in 
combination with a custom-built library of plant repeat elements for our initial 
classification of transposable elements. The customized library was generated by 
combining plant repeats from Repbase and plant repeat databases from TIGR 
(ftp://ftp.tigr.org/pub/data/TIGR_Plant_Repeats)**. Repeat elements identified 
as ribosomal RNA sequences in the TIGR databases match a large fraction of the 
papaya genome (about 3%). Ribosomal RNAs were identified separately, and 
therefore were excluded from our repeat library, leaving a database of 76,924 
repeat sequences that were used to search the papaya genome. 

Homology-based methods are limited to finding elements that have not 
diverged too greatly from known repeats. Because databases of known transpos- 
able elements are necessarily incomplete, we used additional de novo methods to 
search for repeat elements in papaya contigs. For this, we applied two recently 
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developed repeat-finding tools, PILER and RepeatScout to the complete set of 
contigs from the papaya genome**”'. PILER was able to find 428 repeat families 
whereas RepeatScout found 6,596 repeat sequences. 

The repeat families obtained from PILER and RepeatScout were annotated 
using a combination of manual curation (786 repeat families) and automated 
analysis. For the automated annotation, the combined data set from PILER and 
RepeatScout was made non-redundant (using CD-HIT at the 90% similarity 
level), leaving behind 6,240 repeat families’. As a post-processing step, we 
selected only those families that had at least ten good (E value <1 X 107°) 
BLAST matches to papaya contigs. The resulting data set contained 2,198 repeat 
families in the papaya genome. BLAST searches against non-redundant and 
PTREP (http://wheat.pw.usda.gov/ITMI/Repeats) were then used to identify 
repeat families matching genes associated with transposons and retrotranspo- 
sons. This procedure discovered an additional 103 repeat families that could be 
annotated as being retrotransposons. The combined database of 889 annotated 
papaya-specific transposable-element sequences was used in addition to the 
database of known repeats to annotate the papaya genome. The remaining, 
unannotated repeat families (1,455 sequences with no matches to known genes) 
were then used to estimate the additional repeat content of the genome. 
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Genetic variation in human NPY expression affects 
stress response and emotion 
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Matti Virkkunen°, Deborah C. Mash°, Robert H. Lipsky’, Xian-Zhang Hu’, Colin A. Hodgkinson’, Ke Xu’, 
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Understanding inter-individual differences in stress response 
requires the explanation of genetic influences at multiple pheno- 
typic levels, including complex behaviours and the metabolic res- 
ponses of brain regions to emotional stimuli. Neuropeptide Y 
(NPY) is anxiolytic’’ and its release is induced by stress*. NPY is 
abundantly expressed in regions of the limbic system that are 
implicated in arousal and in the assignment of emotional valences 
to stimuli and memories**. Here we show that haplotype-driven 
NPY expression predicts brain responses to emotional and stress 
challenges and also inversely correlates with trait anxiety. NPY 
haplotypes predicted levels of NPY messenger RNA in post- 
mortem brain and lymphoblasts, and levels of plasma NPY. 
Lower haplotype-driven NPY expression predicted higher 
emotion-induced activation of the amygdala, as well as diminished 
resiliency as assessed by pain/stress-induced activations of endo- 
genous opioid neurotransmission in various brain regions. A 
single nucleotide polymorphism (SNP rs16147) located in the pro- 
moter region alters NPY expression in vitro and seems to account 
for more than half of the variation in expression in vivo. These 
convergent findings are consistent with the function of NPY as an 
anxiolytic peptide and help to explain inter-individual variation in 
resiliency to stress, a risk factor for many diseases. 

Variation in stress resiliency influences many human characteris- 
tics, including both normal and_ pathological behaviour’. 
Maladaptive responses to stress are critical in the development of 
many psychiatric disorders, including mood and anxiety disorders*”’. 
Anxiety and emotionality (neuroticism) are moderately to highly 
heritable traits (40-60%) but are also strongly influenced by expo- 
sures to stress in a pattern consistent with gene—environment inter- 
action’®. These observations point to the importance of genes that 
modulate the effects of stress. Genes—such as the serotonin trans- 
porter—that have so far been implicated in emotional responses have 
small effects on complex behavioural traits'’, but larger effects on the 
metabolic responses of the brain to emotional stimuli accessed by 
brain imaging'*""*. 

We evaluated effects of neuropeptide NPY on emotion and stress 
resiliency using a haplotype-based approach intended to capture 
effects of unknown loci or locus combinations. We analysed func- 
tionally grouped NPY haplotypes against a complex behaviour, trait 
anxiety, and also on intermediate phenotypes accessed by two differ- 
ent brain imaging modalities in which gene effects might be more 


strongly manifested. An NPY seven-marker panel (Supplementary 
Fig. la) genotyped in 516 Finnish Caucasians captured the major 
haplotypes and linkage disequilibrium features observed in the 
International HapMap Project (http://www.hapmap.org). A block 
of strong pairwise linkage disequilibrium encompasses 70% of the 
gene and extends from the 5’ region to exon 3 (Supplementary Fig. 
1b). Five haplotypes (H1—H5) account for 93.8% of chromosomes in 
this block (Fig. la). 

We observed haplotype-driven NPY mRNA expression in post- 
mortem brain (US Caucasians, Miami sample) by detecting the dif- 
ferential expression of alleles at single nucleotide polymorphism 
(SNP) rs5574 C/T, selected because of its high frequency and location 
in the transcript. Of these 28 samples, chosen because all were 
heterozygous for rs5574, 16 (57%) showed differential allele expres- 
sion at an allele ratio of more than 1.2, in either direction. Hl and H4 
were low-expression haplotypes, H2 was high, H3 was intermediate 
and H5 was unclassified because only two H1/H5 heterozygous brains 
were available (Fig. 1b). This expression-based functional classifica- 
tion is consistent with a cladistically based clustering of haplotypes, 
indicating that expression variation is linked to gene ancestry (Fig. la). 
The effects on expression of the more common H1, H2 and H3 
haplotypes were verified in 47 lymphoblastoid cell lines derived from 
healthy Finnish men (Fig. 1c) representing the six common diplotypes 
(72% of all diplotypes). On the basis of lymphoblast NPY mRNA 
levels, the expression value for each haplotype was calculated by 
regression analysis. Expression values for the six common diplotypes 
were well predicted under a co-dominant model and had a threefold 
range (see Supplementary Fig. 2 for details). Diplotypes were clustered 
into three expression groups: low (LL: H1/H1), intermediate (LH: H1/ 
H3, H3/H3 and H1/H2) and high (HH: H2/H3 and H2/H2) (Fig. Ic). 
We applied this grouping in subsequent analyses. Two loci, rs3037354 
and rs16147, which differentiate the three common haplotypes (see 
Fig. 1a), correlated with expression (Supplementary Fig. 3). However, 
NPY haplotypes accounted for more variation. Finally, haplotype- 
predicted NPY expression was correlated with plasma NPY peptide 
levels available in a US sample (New Haven) in both controls (n= 24) 
and alcoholic patients (7 = 18) sampled during a no-stress condition. 
Individuals with the low-expression LL diplotype had lower NPY 
levels than those with high-expression HH diplotypes; individuals 
with LH diplotypes were intermediate (P< 0.0001 in controls and 
P= 0.0074 in alcoholic patients; Fig. 1d). 
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Figure 1| Haplotype-predicted NPY expression in brain, lymphoblasts and 
plasma. a, Configuration, frequencies (right) and cladistic clustering (left) 
of major NPY haplotypes (H1, H2, H3, H4 and H5; frequency more than 
1%) in Finnish Caucasians (n = 516). b, Differential allele expression of NPY 
mRNA showing allelic ratio (logs(C/T)) of rs5574 in 28 heterozygous human 
post-mortem cerebella (Miami sample) grouped according to diplotypes 
(columns 2-5, means + s.d.) to infer expression levels of H2, H3, H4 and H5 
relative to H1 (the T allele is found only within H1, as shown in 

a). Diplotypes are compared using a two-tailed t-test. Column 1 presents all 
samples; column 6 shows controls for allelic amplification efficiency. 

c, Expression levels of the six common diplotypes in 47 lymphoblastoid cell 


These common, functionally significant NPY haplotypes were 
evaluated for their effect on brain responses to emotion and stress. 
Amygdala activation in response to threat-related facial expressions 
and other provocative stimuli predicts affective arousal, including 
anxiety responses'*’®. We employed a widely used functional mag- 
netic resonance (fMRI) probe’”"” to assess whether NPY diplotypes 
predicted amygdala reactivity to threat-related facial expressions in 
71 healthy volunteers (Pittsburgh sample). This model has been used 
to identify greater amygdala reactivity in individuals possessing the 
lower-transcribing allele of the serotonin-transporter-linked poly- 
morphic region (5-HTTLPR)'”'’. As shown in Fig. 2, amygdala 
activation in individuals with the low-NPY-expression (LL) diplo- 
type was higher than in those with the high-expression (HH) diplo- 
type (P = 0.003). NPY diplotype predicted amygdala reactivity in an 
allele-dosage fashion, and it accounted for 9% of the variance in the 
fMRI amygdala response to emotional challenge. Task-related 
hippocampal activation was similarly predicted in an allele-dosage 
fashion (P = 0.006; Fig. 2). Functional interactions of the amygdala 
and hippocampus are crucial for emotional memories, and long- 
lasting changes in hippocampal architecture are induced by stress’. 

Wealso tested the ability of NPY haplotypes in a model of physical 
and emotional stress involving moderate levels of sustained muscular 
pain. This physical and emotional stress activates endogenous opioid 
neurotransmission in regions of the brain that regulate pain, stress 
and emotion”'. Endogenous opioid release suppresses pain, stress 
and anxiety-like responses in animal models*”’ and humans”. The 
behavioural effects of NPY are mediated, at least in part, through 
interactions with the endogenous opioid system**”*. We measured 
endogenous opioid release by decreases in the availability of 1-opioid 
receptors in vivo during the painful stressor, quantified by means of 


998 


lines derived from healthy Finnish Caucasians. Expression values of the 
three major haplotypes (H1, H2 and H3) were calculated from observed 
diplotype values (points and error bars show means = s.e.m.) by multiple 
regression and were used to predict diplotype values (bars) under a co- 
dominant model (see Supplementary Fig. 2 for details, including fit to 
model). Diplotypes were grouped as low (LL), intermediate (LH) or high 
(HH) in expression. Numbers above columns are n. d, Diplotype-predicted 
plasma NPY levels (means + s.e.m., standardized with Z scores; New Haven 
sample) in controls (black diamonds; LL, n = 16; LH, n = 113; HH, n = 39) 
and alcoholic patients (grey squares; LL, n = 29; LH, n = 72; HH, n = 27). P 
values were calculated by regression analysis. 


positron emission tomography (PET) with the selective L1-opioid 
receptor radiotracer [''C]carfentanil?*?! (see Supplementary 
Methods for details). In 35 healthy volunteers (Ann Arbor sample), 
we found that highly expressed NPY diplotypes predicted signifi- 
cantly higher levels of stress-induced 1-opioid system activation in 
several brain regions (Fig. 3 and Supplementary Table 1) including 
prefrontal cortex, posterior insula, medial and lateral thalamus, vent- 
ral basal ganglia (ventral caudate, ventral putamen and nucleus 
accumbens) and amygdala (analysis of variance (ANOVA), 
P<0.05 after correction for multiple comparisons). NPY diplotype 
accounted for 13% of the variance in activation of [1-opioid neuro- 
transmission in the amygdala, 18-35% of the variance in prefrontal 
cortex, thalamus and nucleus accumbens, and 37% of the variance in 
posterior insular cortex. In comparison with its effects on the activa- 
tion of endogenous opioid neurotransmission by painful stress, NPY 
explained less of the variance in the more complex, self-rated pain 
and affective response phenotypes. NPY diplotypes accounted for 3% 
of the variance in subjective pain (McGill Pain Questionnaire sensory 
subscale) and 5% of the variance in emotional experience (Positive 
and Negative Affectivity Scale negative affect) (Supplementary Fig. 
4). 

In comparison with gene-influenced brain imaging responses in 
which allele action has been evident even in small data sets'*""*, trait 
anxiety is a complex behaviour for which gene effects are small"’, and 
it is the type of gene-influenced behaviour that is perturbed by 
external factors such as exposure to stress'°**. However, it is import- 
ant to understand the role of NPY in complex behaviours and in 
different contexts. In a relatively modest sample of 137 healthy 
Finnish Caucasian controls, expression predicted by NPY diplotype 
was inversely correlated with trait anxiety (Fig. 4a), measured with 
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Figure 2 | Effect of diplotype-predicted NPY mRNA expression on fMRI- 
measured amygdala and hippocampal activation in response to threat- 
related facial expressions. Top: statistical parametric maps representing 
NPY diplotype-biased (LL > LH > HH) mean right dorsal amygdala 

(x = 16, y= —8,z= —14, 64 voxels, t = 2.82, P = 0.003) and right 
hippocampal (x = 24, y 20, z 12, 132 voxels, t = 2.56, P = 0.006) 
activation is shown overlaid on an average sagittal and coronal MRI. Bottom: 
right amygdala (black diamonds) and hippocampal (grey squares) activities 
(means and s.e.m.) from clusters grouped by NPY diplotypes. 
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the Tridimensional Personality Questionnaire (TPQ) Harm 
Avoidance subscales HA1 (Fear of Uncertainty; P= 0.035) and 
HA2 (Anticipatory Worry; P = 0.031). There was no correlation with 
the HA3 (Shyness with Strangers) or HA4 (Fatigability and Asthenia) 
subscales. Diplotype-predicted NPY mRNA expression was lower in 
the small number of Finnish participants with clinical anxiety dis- 
orders (= 18) diagnosed with the Structured Clinical Interview for 
the Diagnostic and Statistical Manual of Mental Disorders, 3rd edi- 
tion revised (SCID), in comparison with the same 137 healthy 
Finnish controls (Fig. 4b). Within Finnish participants with SCID- 
diagnosed alcoholism (m= 138), drug addiction (m= 38), anxiety 
disorders (1 = 18) and major depression (n = 22), we observed no 
correlation of NPY-diplotype-predicted expression with any of the 
HA subscale scores. This may indicate dysregulation of the stress axis 
in these patients; in addition, the sample sizes were not large. None of 
the seven individual markers (Supplementary Fig. 1a) were assoc- 
iated with HA subscales. In contrast with the larger effects of NPY 
diplotype on brain functional responses, the effects of NPY on trait 
anxiety were modest, accounting for 3.3% of the variance in HAI and 
3.4% of the variance in HA2. Very large study samples are required 
for the consistent detection of gene effects on a complex behaviour 
such as trait anxiety. We have presented converging results in which 
modest effects of genes at the level of complex behaviour are sup- 
ported, and mechanistically clarified, by larger effects on brain- 
imaging phenotypes reflecting response to emotion and stress. 
Population stratification, a potential confounding factor, was 
addressed with the use of ancestry informative markers (AIMs). As 
detailed (Supplementary Fig. 5), 186 highly informative AIMs were 
genotyped; this process was followed by a factor analysis anchored 
against a panel of 1,017 Centre d’Etude du Polymorphisme Humain 
(CEPH) worldwide diversity samples representing 52 populations, 
and yielding ethnic factor scores for each individual. These 186 AIMs 
yielded a similar seven-factor solution to that observed” on the basis 
of short tandem repeat markers genotyped in the same populations. 
Our analyses of PET pain/stress response, plasma NPY levels and 
TPQ Harm Avoidance subscales were not confounded by ethnicity, 
as revealed by comparisons of individuals above and below medians. 
For the emotional {MRI imaging sample we had available a set of 15 


Figure 3 | Effect of diplotype-predicted NPY 
mRNA expression on pain/stress-induced 
p-opioid system activation. Activation 

(means = s.e.m.) measured by the percentage 
change in the 1-opioid receptor binding potential 
(BP) among the NPY diplotypes (LL, n = 8; LH, 
n = 21; HH, n = 6) are shown in the dorsomedial 
prefrontal (dmPFC) and posterior insular (pINS) 
cortices, ventrolateral thalamus (vITHA), ventral 
putamen (PUT) and caudate (CAU) nuclei, 
nucleus accumbens (NAC) and ventral amygdala 
(AMY; Supplementary Table 1). Z scores 
represented by the pseudocolour scale (left) are 
superimposed on an anatomically standardized 
MRI image. See also Supplementary Table 1 for 
details of the localization and statistics. 
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Figure 4 | Correlations of diplotype-predicted NPY mRNA expression with 
TPQ trait anxiety and anxiety disorders in Finnish Caucasians. a, Bivariate 
regression of HAI and HA2 scores (means and s.e.m.) and diplotype- 
predicted expression values in 137 healthy subjects (left to right: H1/H1, 
n = 33; H1/H3, n = 35; H3/H3, n = 5; H1/H2, n = 39; H2/H3, n = 15; H2/ 
H2, n = 10). b, Diplotype-predicted NPY mRNA levels (means and s.e.m.) of 
n = 137 healthy controls and n = 18 anxiety patients by a two-tailed t-test 
(means = s.e.m. for HA1 and HA2 are also shown). At the top right is a 
histogram comparing the numbers of controls and anxiety patients within 
each of the six diplotypes. 


AIMs, again with no difference between low and high responders 
(data not shown). 

We identified a locus accounting for part of the NPY haplotype 
effect by testing the four moderately common variants found in the 
NPY promoter region (—1016 base pairs (bp) to 63bp; see 
Supplementary Fig. 6a for details) for their ability to influence 
mRNA expression. The five naturally occurring allele combinations 
were inserted into a promoterless reporter gene vector (pDsRed2-1) 
and NPY promoter haplotype-driven expression was analysed by 
transient transfection into the raphe neuronal cell line RN46A. As 
shown in Supplementary Fig. 6b and Supplementary Table 2, the 
—399C allele (rs16147) accounted for a 30% decrease in basal 
expression determined by comparing the allelic variations between 
the promoter haplotypes. In addition, the TGins allele located at 
—883 bp (183037354) may decrease expression, although to an extent 
not reaching statistical significance. The combination of —399C and 
—883TGins reduced expression 47%. These results are consistent 
with the effect of —399C and the smaller effect of —883TGins on 
mRNA expression in post-mortem brain and lymphoblastoid cell 
lines, and with the greater predictive value of haplotypes both in vitro 
and in vivo (Fig. 1b, c, and Supplementary Fig. 3), and underlies our 
choice to emphasize haplotype effects of NPY. 

Haplotype-based association analysis maximizes the ability to cap- 
ture information, haplotypes serving as proxies for unknown alleles. 
However, the existence of multiple haplotypes can lead to a loss of 
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analytic power without some mode of clustering, for example on a 
cladistic basis**. In this study we functionally grouped most NPY 
haplotypes and diplotypes according to levels of expression in vivo. 
There were two less common haplotypes (H4, with a frequency of 
4.3%, and H5, with a frequency of 4.6%) that were excluded from 
association analyses because of a lack of definitive in vivo expression 
data. However, on the basis of in vitro data for both (Supplementary 
Fig. 6b), and limited brain expression data for H4 (Fig. 1b), it is likely 
that H4 is a low-expression haplotype and H5 is a high-expression 
haplotype. In addition, the H5 haplotype uniquely contains the mis- 
sense variant Leu 7—Pro (rs16139C). Several studies have associated 
Pro 7 with disordered glucose and lipid metabolism”””° but not with 
anxiety, and Pro7 was not associated with anxiety in our data set 
(data not shown). 

We observed effects of haplotype-predicted NPY expression on 
human trait anxiety and on neurobiological circuits and neurotrans- 
mitter systems implicated in the regulation of emotional and stress 
responses. Although the effect of haplotype-predicted NPY expres- 
sion was modest for trait anxiety, consistent effects were observed 
across related measures and were more evident for brain metabolic 
responses to emotional images as measured with {MRI and molecular 
imaging measures of the activation of the endogenous opioid system 
after a stressful challenge. These findings indicate the important role 
of NPY in modulating inter-individual variation in emotion and 
stress resiliency, and reflect the value of a multilevel approach to 
the genetic analysis of behaviour. 


METHODS SUMMARY 


Participants from five independent samples were analysed. A sample of 516 
Finnish Caucasians was used to identify NPY haplotype configuration and link- 
age disequilibrium structure, and subsamples were analysed for lymphoblast 
NPY mRNA, trait anxiety, and anxiety disorders. The New Haven sample 
(n= 42) was measured for plasma NPY. The Pittsburgh sample (n= 71) 
received {MRI after an emotional probe. The Ann Arbor sample (n = 35) under- 
went [''C]carfentanil PET during a pain/stress challenge. A collection of post- 
mortem brains from the Miami sample was genotyped and 28 heterozygous 
cerebella were used for differential allele expression. A full description of 
participants’ demographic data and psychiatric diagnoses is provided in 
Supplementary Methods. Informed consent was obtained under the auspices 
of human research protocols approved by institutional review boards of the 
National Institutes of Health, the University of Helsinki, the University of 
Pittsburgh, the University of Michigan, and Yale University. NPY genotyping 
was performed with 5’-nuclease assays. Genotyping with 186 AIMs was per- 
formed with an Illumina 1536-SNP array. NPY mRNA quantification in lym- 
phoblastoid cell lines was performed with reverse transcription (RT) followed by 
TaqMan real-time PCR. Detection of allele-specific NPY expression in post- 
mortem brain was performed with an RT-coupled 5’ nuclease assay. Plasma 
concentrations of NPY were measured by radioimmunoassay. DNAs containing 
NPY promoter haplotypes were inserted into a reporter vector, pDsRed2-1, and 
transfected to the RN46A cell line for the analysis of promoter variants. Details of 
the {MRI emotional model, the blood-oxygenation-level-dependent (BOLD) 
fMRI acquisition parameters, the experimental design for pain stressor chal- 
lenge, the PET scan processes, image reconstruction, and the data analysis are 
provided in Supplementary Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Genotyping. NPY genotyping was carried out using 5’-nuclease assays for six 
SNP markers (rs17149106, rs16147, rs16139, 1s5573, rs5574 and rs16475) in 
accordance with the standard TaqMan Assay protocol (ABI). A dinucleotide 
insertion/deletion polymorphism (rs3037354) was genotyped by sequencing 
because of the presence of an adjacent SNP. Results were analysed on an ABI 
7900 Sequence Detector, using Sequence Detection System 2.0 software. 
Genotyping accuracy was verified by genotyping in a replicate 10% of samples, 
randomly selected. Genotyping accuracy was greater than 0.99. Genotype com- 
pletion rate was greater than 0.96. Sequencing was performed with BigDye 
Terminator v3.1 reagent and analysed on an ABI 3100 sequencer. The Finnish 
Caucasian sample, the PET imaging pain stress response sample (Ann Arbor) 
and the sample measured for plasma NPY levels (New Haven) were also geno- 
typed with AIMs with the use of a GoldenGate Assay (Illumina) 1,536-marker 
array. The marker selection was based on the HapMap Project reference allele 
frequency (RAF) among European Caucasians, Asians, and Africans (Yoruban). 
Selected markers have an RAF difference greater than 0.7 and RAF ratio larger 
than 10:1 between at least one pair of the continental populations and is balanced 
to distinguish between continental populations”’. 

NPY mRNA quantification by real-time PCR. Total RNA was isolated from 
lymphoblastoid cell lines derived from healthy Finnish men with TRIzol reagents 
(Invitrogen). RNA samples were treated with RNase-free DNase (Ambion) to 
remove contaminating DNA. NPY mRNA quantification was performed with a 
two-step TaqMan real-time RT-PCR assay. Total RNA (1 [ig) was reverse-tran- 
scribed into cDNA with a Cloned AMV First-Strand cDNA Synthesis Kit and 
random hexamers as primers, following the manufacturer’s instructions. CDNA 
(3 pl) was applied in 25 tl of a TaqMan real-time PCR reaction on an ABI 7700 
Sequence Detector with the ABI Assay-on-demand gene expression kit and using 
B-actin as an endogenous control for normalization. Signals were analysed with 
Sequence Detector System software 2.0. Relative NPY mRNA expression was 
calculated by the formula 2—44G, where AC, is the difference in C, (PCR cycle 
number when signal has reached the threshold) between NPY and f-actin. A 
triplicate of each sample was used to determine each measurement and the assay 
was repeated at least three times. 

Detection of allele-specific expression of human NPY in cerebellum. 
Differential allelic expression analysis was performed with a RT-coupled 5’ 
nuclease assay. A coding SNP, rs5574 (C/T) located in NPY exon 3, was selected 
as an endogenous reporter for the assay (see Supplementary Table 3 for primer/ 
probe sequences). Total RNA (1 pg) extracted from post-mortem cerebella of 
individuals heterozygous at rs5574 was reverse-transcribed. cDNA (2 ll) was 
used in a 25-ul reaction for TaqMan real-time PCR. For the control of amp- 
lification efficiency, a template with an artificial 1:1 allelic ratio, constructed by 
inserting one copy each of the cDNA amplicon containing either the C allele and 
T allele in tandem into the pDsRed2-1 plasmid, was also assayed on the same 
plate. The principle and the method of computation of allele expression ratios 
were described previously”. 
Quantification of plasma NPY. Blood samples were obtained at 08:45 (45 min 
after placement of the intravenous catheter) and every 15 min thereafter for a 
total of eight time points under the neutral condition. Plasma concentrations of 
NPY were measured by using our previously well-characterized double-antibody 
radioimmunoassay with '*°I-NPY as the tracer”. The working range of the NPY 
assay is 19.5-1,250 pg ml |, and the assay sensitivity is about 15 pg ml |. The 
intra-assay and inter-assay coefficients of variation are about 4% and 14%, 
respectively. NPY concentrations were standardized with the Z score to mini- 
mize between-assay variation. 
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Report gene constructs and transfection assays. Human NPY promoter and 
upstream regulatory region (from — 1016 bp to 63 bp) were amplified by PCR in 
a 50-1 volume from genomic DNA of individuals who were either homozygous 
or heterozygous for the five common haplotypes of the region. The PCR frag- 
ments were digested with both XhoI and HindIII restriction enzymes, and ligated 
into the multiple cloning sites upstream of the DsRed protein reporter gene of 
the pDsRed2-1 plasmid (BD Biosciences). Ligation products were used to trans- 
form 50pl of TOP10 competent cells (Invitrogen). Plasmid DNAs were 
extracted, and clones with an insert of the correct size were verified by DNA 
sequencing. For transfection analysis, RN46A cells were grown in 50% DMEM 
medium and 50% Ham’s F-12 medium supplemented with 10% FBS. Cells were 
plated out 24h before transfection at a density of 5 X 10° cells per well in 24-well 
plates, and each group of samples was plated in triplicate wells. The constructed 
NPY promoter haplotype-pDsRed plasmid DNA (1.6 jig) and green fluorescent 
protein (GFP) plasmid DNA (0.05 11g), used as a transfection efficiency control, 
were co-transfected into the cultured cells with Lipofectamine 2000 (Invitrogen) 
in accordance with the manufacturer’s instructions. pDsRed vector DNA with- 
out a NPY promoter insert was also used for each transfection as a negative 
control. Both red (DsRed protein reporter gene) and green (GFP) fluorescent 
signals were captured through a fluorescent microscope with the use of OpenLab 
software (Improvision), 48 h after transfection. For each well of the transfected 
cells, ten fields of signals were captured across the same centre areas of the well, 
and a total of 30 fields of both red and green signal were captured for each group 
of cells. For each field red and green fluorescent signals were measured in the 
same region of interest (ROI) with the red signal of each field normalized against 
the green signal. The transfection analysis was repeated at least three times. The 
mean and standard deviation of reporter gene signal were calculated for each 
promoter haplotype group, and the expression levels were compared between 
haplotypes by using a t-test (two-tailed). 

Data analysis. Genotype distributions were tested at each polymorphic locus for 
departure from Hardy—Weinberg equilibrium. Pairwise linkage disequilibrium 
coefficients (D') were estimated and linkage disequilibrium block structures 
were evaluated with Haploview™. A minimum average D’ value of 0.80 was used 
to define block boundaries (however, the average D’ in most blocks was sub- 
stantially higher). Maximum-likelihood estimates of haplotypes and frequencies 
were determined with PHASE v2.02 (ref. 35), and diplotypes were assigned to 
each individual. For haplotype cladistic clustering, the Manhattan distances 
among haplotypes were calculated with a modified metric in which more weight 
was given to markers with higher minor allele frequency and to markers with 
higher 7° values to neighbouring markers (Q.Y., unpublished observations). 
Continuous variables were compared among diplotype groups by a two-tailed 
t-test, analysis of variance, and regression analysis, and are presented as 
means + s.e.m. 
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Insect olfactory receptors are heteromeric 


ligand-gated ion channels 


Koji Sato’, Maurizio Pellegrino”, Takao Nakagawa't, Tatsuro Nakagawa’, Leslie B. Vosshall? & Kazushige Touhara’ 


In insects, each olfactory sensory neuron expresses between one 
and three ligand-binding members of the olfactory receptor (OR) 
gene family, along with the highly conserved and_ broadly 
expressed Or83b co-receptor’’. The functional insect OR consists 
of a heteromeric complex of unknown stoichiometry but compris- 
ing at least one variable odorant-binding subunit and one constant 
Or83b family subunit’®’’. Insect ORs lack homology to G-protein- 
coupled chemosensory receptors in vertebrates’’ and possess a 
distinct seven-transmembrane topology with the amino terminus 
located intracellularly'’®'*. Here we provide evidence that hetero- 
meric insect ORs comprise a new class of ligand-activated non- 
selective cation channels. Heterologous cells expressing silkmoth, 
fruitfly or mosquito heteromeric OR complexes showed extracel- 
lular Ca** influx and cation-non-selective ion conductance on 
stimulation with odorant. Odour-evoked OR currents are inde- 
pendent of known G-protein-coupled second messenger path- 
ways. The fast response kinetics and OR-subunit-dependent K* 
ion selectivity of the insect OR complex support the hypothesis 
that the complex between OR and Or83b itself confers channel 
activity. Direct evidence for odorant-gated channels was obtained 
by outside-out patch-clamp recording of Xenopus oocyte and 
HEK293T cell membranes expressing insect OR complexes. The 
ligand-gated ion channel formed by an insect OR complex seems 
to be the basis for a unique strategy that insects have acquired to 
respond to the olfactory environment. 

To examine the mechanism of signal transduction of multimeric 
OR + Or83b complexes, we expressed ORs with known ligand 
specificities from silkmoth (Bombyx mori), fruitfly (Drosophila mela- 
nogaster) and the malaria vector mosquito (Anopheles gambiae), 
together with members of the Or83b co-receptor family from each 
species (BmOr-2, Or83b and GPROR7, respectively) in heterologous 
cells and performed Ca** imaging and electrophysiology experi- 
ments. HeLa cells expressing Drosophila Or47a + Or83b or 
Anopheles GPROR2 + GPROR7 showed ligand-dependent and 
dose-dependent”"?"” increases in intracellular Ca** that required 
the Or83b-class co-receptor’’’’ (Fig. 1a, b, and Supplementary 
Fig. 1). Inward current responses to pentyl acetate (PA) or 2-methyl 
phenol (2-MP) were observed by whole-cell patch-clamp recordings 
of HeLa cells expressing Or47a + Or83b or GPROR2 + GPROR7, 
respectively, at a holding potential of —60 mV (Fig. Ic, d). The cur- 
rent showed a slight outward rectification similar to that previously 
observed for Bombyx BmOr-1 + BmOr-2 expressed in Xenopus 
oocytes’’. Reversal potentials of —1.9+0.8 and +0.2+1.1mV 
(means + s.e.m.) for Or47a+ Or83b and GPROR2 + GPROR7, 
respectively, were measured (Fig. le-h). Waveforms of outward 
and inward currents were completely symmetrical (Fig. le, f), sug- 
gesting that odorants activate a non-selective cation conductance 
through the insect OR complex. 


We next characterized the source of increase in intracellular Ca” * 
when insect ORs are stimulated by odour ligands (Fig. 2). Chelating 
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Figure 1| Odorants activate a non-selective cation conductance in HeLa 
cells expressing multimeric insect ORs. a, c, e, g, Recording from a HeLa cell 
expressing Or47a, Or83b and «-adrenergic receptor («%)-AR) on stimulation 
with 100 UM pentyl acetate (PA), ethyl acetate (EA), 2-heptanone (2-Hep), 
eugenol (EG) and 100 nM noradrenaline (NA). b, d, f, h, Recording from a 
HeLa cell expressing GPROR2, GPROR7 and «-AR on stimulation with 
100 LM 2-methyl phenol (2-MP), 3-methyl phenol (3-MP), 4-methyl phenol 
(4-MP), EG, benzaldehyde (BA) and 100 nM NA.a, b, Fura-2 acetoxymethyl 
ester (Fura-2/AM) Ca’ imaging in response to ligand stimulation for 10s 
(arrows indicate onset). ¢, d, Inward current responses with various stimulus 
durations of PA (c) and 2-MP (d). e, f, Current responses to stimulus 
(bottom traces indicate onset) at various holding potentials (n = 14 and 4, 
respectively). g, h, Current—voltage (I-V) relationship; the respective peak 
current of each response in e and f is plotted. 
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extracellular Ca’* with EGTA or bis-(o0-aminophenoxy)ethane-N, 
N,N',N’-tetra-acetic acid (BAPTA) strongly diminished the increase 
in Ca’* mediated by Or47a + Or83b odour activation but had no 
effect on the increase in Ca** stimulated by activation of the Ga,- 
coupled o-adrenergic receptor («,-AR) by noradrenaline (Fig. 2a, b). 
This suggests that most of the increase in intracellular Ca”* observed 
when insect ORs are activated derives from the influx of extracellular 
Ca’. Drosophila olfactory sensory neurons show receptor-dependent 
spontaneous activity in the absence of odour ligands*”*”°', and we 
examined whether such spontaneous activity can be detected in het- 
erologous cells expressing insect ORs. In Or47a + Or83b-expressing 
cells, the resting intracellular Ca** level was higher than that in 
mock-transfected cells (Supplementary Fig. 2a) and this was strongly 
dependent on the extracellular Ca** concentration (Supplementary 
Fig. 2b, c). Basal intracellular Ca** levels were decreased by extracel- 
lular application of EGTA in Or47a + Or83b-expressing cells (Fig. 2a 
and Supplementary Fig. 2b), suggesting that insect ORs mediate spon- 
taneous Ca’ influx in the absence of odour ligand. 

To examine whether this odour-evoked activation requires known 
intracellular signalling cascades, we performed pharmacological 
experiments. No current response was observed in whole-cell patch- 
clamp experiments using a recording pipette filled with intracellular 
solution containing cyclic AMP, cyclic GMP or inositol-1,4,5- 
trisphosphate (IP;) (Supplementary Fig. 3a). U73122, an inhibitor 
of phospholipase C, abolished «,-AR-mediated Ca** responses but 
did not affect increases in Ca”* mediated by Or47a + Or83b (Fig. 2b, 
d). OR complex activation is therefore largely independent of Ga,/ 
PLC signalling (Fig. 2b, d). Moreover, insect OR activation fails to 
stimulate the Ga,/cAMP pathway, because we did not detect 
increases in intracellular cAMP in HEK293T cells expressing 
Or47a + Or83b, BmOr-1 + BmOr-2 or GPROR2 + GPROR7 after 
stimulation with odour (Fig. 2c and Supplementary Fig. 4a). 
Control cells expressing the mouse OR, mOR-EG, showed robust 
increases. cAMP produced by stimulation of mOR-EG activated 
cyclic nucleotide-gated channel A2 (CNGA2), but activation of 
Or47a + Or83b with pentyl acetate had no effect on this channel 
(Supplementary Fig. 4b, c). Application of 8-bromo-cGMP or 
8-bromo-cAMP failed to produce Ca** responses in HEK293T cells 
or oocytes expressing Or47a+ Or83b or GPROR2 + GPROR7 
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(Supplementary Fig. 5). Although we failed to detect activation of a 
Ga,/cAMP pathway by ligand stimulation of Or47a + Or83b, BmOr- 
1 + BmOr-2 or GPROR2 + GPROR7, we did observe that some OR 
complexes showed a small, ligand-independent sensitivity to cyclic 
nucleotides (Supplementary Fig. 5b). The basis of this sensitivity was 
not investigated further in this study. To examine whether G-protein 
signalling was necessary for insect OR activation, we performed 
experiments with GDP-fS, a non-hydrolysable form of GDP that 
inhibits G-protein-coupled signalling. GDP-BS had no effect on 
the current response of BmOr-1 + BmOr-2 to bombykol (Fig. 2e) 
or the response of Or47a + Or83b to pentyl acetate (Supplementary 
Fig. 3d) when these receptor complexes were expressed in oocytes, 
but GDP-BS significantly inhibited Ca**-activated Cl” current res- 
ponses induced by the activation by noradrenaline of o|-AR/Ga,/ 
PLC- and cystic fibrosis transmembrane conductance regulator 
(CFTR) currents elicited by stimulation of B-adrenergic receptor 
(B2-AR)/Ga,/adenylyl cyclase pathways by isoprenaline (isoprotere- 
nol) (Fig. 2e). These results make it unlikely that G-protein-signalling 
cascades underlie responses of the insect OR complex. 

To examine the response kinetics of an OR heteromultimer, we 
recorded increases in intracellular Ca** and electrical responses in 
the same cell. A HeLa cell expressing Or47a + Or83b and o,-AR 
showed a Ca** response to both pentyl acetate and noradrenaline, 
but showed an electrical response to pentyl acetate only, not to nor- 
adrenaline (Fig. 3a). The latency of the Or47a + Or83b-mediated 
Ca°* response was less than one-tenth that of the «;-AR/Ga,/PLC- 
mediated Ca*~ response (240 + 46 ms versus 3,300 + 400 ms). The 
latencies of current responses of Or47a + Or83b (Fig. 3b) and 
GPROR2 + GPROR7 (Fig. 3c) were 17.9 + 3.1 and 28.5 + 1.9 ms, 
respectively. This is considerably shorter than the about 50—200-ms 
latency of the G-protein-coupled olfactory response in vertebrate 
olfactory neurons’, in which the loading of G proteins is the 
rate-limiting step%. The time course of Ca** responses of 
Or47a + Or83b and GPROR2 + GPROR7 was also distinct from 
those of «,-AR (Fig. 3f) and mOR-EG (Fig. 3g). The slope of the 
odorant-induced insect OR response was not modulated by the dura- 
tion of stimulation (Fig. 3d, e), whereas the slope of noradrenaline- 
induced «-AR or eugenol-induced mOR-EG responses was sharper 
when the stimulation period was prolonged (Fig. 3f, g). 
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Figure 2 | Insect OR activity is independent of G protein signalling. a, Ca*~ 
responses of HeLa cells expressing Or47a + Or83b, o-AR or 

Or47a + Or83b + «-AR to a 10-s stimulation with 100 uM pentyl acetate 
(PA) or 100 nM noradrenaline (NA) with application of 10 mM EGTA (red 
bar). b, Effects of Ca?* chelators (10 mM EGTA and 5 mM BAPTA) and 
U73122 (PLC inhibitor; containing 0.1% DMSO) on Or47a + Or83b Ca?* 
response (DMSO control, n = 28; U73122, n = 18; EGTA, n = 10; BAPTA, 
n = 20). Significance assessed by t-test: asterisk, P< 0.001. ¢, cAMP 
production in HEK293T cells expressing mOR-EG, Or47a + Or83b, 


BmOr-1 + BmOr-2 or GPROR2 + GPROR7 stimulated with eugenol 
(EG; 1 mM), PA (100 4M), bombykol (BM; 10 11M) or 2-methyl phenol 
(2-MP; 100 |1M), respectively. d, As in a, but with application of 10 uM 
U73122 (filled bar) or 0.1% DMSO (open bar). e, Effect of GDP-BS on 
ligand-induced inward currents in Xenopus oocytes expressing BmOr- 
1 + BmOr-2 (30 1M BM), 4 -AR (1 1M NA) or B,-AR + CETR (10 uM 
isoprenaline). Significance assessed by t-test: asterisk, P< 0.05; two 
asterisks, P< 0.01; n = 5 each. Data are shown as means + s.e.m. 
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If the heteromultimer combination of the insect OR complex 
determines ion selectivity, we might expect that the ion permeability 
of odour-gated currents would differ with different subunit compo- 
sitions. Indeed, both the reversal potential and outward rectification 
of Or47a + Or83b and Or47a + BmOr-2 differed slightly (Supple- 
mentary Fig. 6a, b). Ruthenium red, a cation-channel blocker, which 
completely inhibits odour-evoked inward current responses of 
BmOr-1 + BmOr-2 in Xenopus oocytes’, inhibited both bomby- 
kol-evoked inward current and basal Ca** levels (Supplementary 
Fig. 6c, d) of BmOr-1 + BmOr-2-expressing cells but had no effect 
on HEK293T cells expressing GPROR2+GPROR7 (data not 
shown). This suggests that insect OR subunit composition affects 
the degree of inhibition by ruthenium red. 

To obtain evidence that currents are produced directly by the 
insect ORs, outside-out patch-clamp recordings were performed 
on oocyte membranes containing Or47a + Or83b or GPROR2 + 
GPROR7. First we established that oocytes expressing GPROR2 + 
GPROR7 showed high-affinity ligand-specific macroscopic currents 
that depended on the expression of both subunits (Supplementary 
Fig. 7a) and required extracellular Na* (Supplementary Fig. 7b, d). 
Patches from uninjected control oocytes did not show current 
responses to two different odour stimuli or oocyte Ringer’s solution 
(Supplementary Fig. 7e), indicating that odorants do not induce 
non-specific currents by interacting with oocyte membrane or 
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Figure 3 | Kinetic analysis of Ca2* and current responses of insect ORs in 
HeLa cells. a, Simultaneous recording of whole-cell current (red trace) and 
intracellular Ca** (Fluo-4-4K; blue trace) from a HeLa cell expressing 
Or47a + Or83b + 0 -AR. Stimulus durations of pentyl acetate (PA; 20 ms) 
and noradrenaline (NA; 500 ms) are indicated at the top. Ca** response 
latency is shown as mean = s.e.m. (m = 8). b, c, Onset of inward current 
responses of Or47a + Or83b to PA (b) and GPROR2 + GPROR7 to 
2-methyl phenol (2-MP) (c) applied with various stimulus durations as 
indicated by the different coloured curves. The latency of current response is 
shown as mean = s.e.m.; scales for b and care shown atthe right inc (b, n = 14; 
c,n = 4). d-g, Onset of Fluo-4-acetoxymethy] ester (Fluo-4/AM)-based Ca?* 
responses of Or47a + Or83b to PA (d), GPROR2 + GPROR7 to 2-MP (e), 
o;-AR to NA (f), and mOR-EG along with Go; to EG (g), with stimuli applied 
for various durations as indicated. Scales for d and e are shown at the right in 
e, and for f and g at the right in g. 
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membrane proteins. Outside-out patch preparations of oocytes 
injected with Or47a + Or83b or GPROR2 + GPROR7 showed tran- 
sient unitary currents whose frequency increased with the application 
of the cognate ligands pentyl acetate or 2-methyl phenol (Fig. 4 
and Supplementary Fig. 7f-h). The channel conductance of the 
Or47a + Or83b complex was 1.2 + 0.03 pA (n=4) at a holding 
potential of —80mV (Fig. 4d). The magnitude of odour-evoked 
currents was voltage-dependent and increased when the holding 
potential was progressively changed from +40mV to —120mV 
(Supplementary Fig. 7h). The frequency of these odour-evoked 
events was concentration-dependent (Fig. 4e). There was no differ- 
ence in pentyl-acetate-evoked conductance from membranes expres- 
sing Or47a + Or83b in the presence or absence of ATP (1 mM)/GTP 
(100uM) in the recording pipette, suggesting that ATP/GTP- 
mediated signal transduction components are not involved in the 
primary conductance of this channel (without ATP/GTP shown in 
Fig. 4a—c; with ATP/GTP shown in Supplementary Fig. 3b, c). 
Similar results were obtained in outside-out patch-clamp record- 
ing of HEK293T membranes expressing Or47a + Or83b or 
GPROR2 + GPROR7 (Fig. 5). In contrast with the bath application 
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Figure 4 | Excised outside-out patch-clamp recording of Or47a + Or83b 
currents measured in oocyte membranes. a-—c, Outside-out patch-clamp 
recording of a Xenopus oocyte membrane expressing Or47a + Or83b before 
stimulation (a), during stimulation with 300 [iM pentyl acetate (PA) (b), and 
after wash out (c). The bottom traces of each panel indicate expansions of 
300-ms current traces of single-channel recording at the positions indicated 
by the numbers. The data for a—c were obtained from the same cell with 
voltage clamped at —80 mV. Scales for a—c are indicated at the top in a and 
scales for the expansions are at the bottom in a. d, All-point current 
histograms of unitary events before (blue) and during (red) application of 
the ligand PA in b and c. Amplitude distributions were fitted with two 
gaussian components (black lines). e, Dose-response and ligand selectivity 
of Or47a + Or83b currents with voltage clamped at —90 mV. 
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Figure 5 | Excised outside-out patch-clamp recording of insect OR currents 
measured in HEK293T membranes. a—d, Or47a + Or83b-expressing 
HEK293T membranes stimulated for 3 s with pentyl acetate (PA; 300 uM) 
(a, b) or the non-agonist eugenol (EG; 300 |iM) (¢, d). The bottom traces show 
expansions of the current trace of the single-channel recording at the 
positions indicated by coloured boxes on trace 2 before (blue), during (red) 
and after (green) stimulation with PA (a) or EG (c). All-point current 
histograms were obtained from the region indicated on trace 1 by the blue bar 
without stimulation (blue histogram, left) and by the red bar for stimulation 
with PA (b) or EG (d) (red histogram, right). Mean peak current level (1.6 pA) 
was obtained from the fitted gaussians. Data in a—d were obtained from the 
same cell. Scales for a and ¢ are indicated at the top in ¢ and scales for the 
expansions are at the bottom in c. The scale on the y axis indicates the relative 
number of current events with a bin width of 0.05 pA. e-h, As in a—d, but 
using GPROR2 + GPROR7-expressing HEK293T membranes stimulated for 
3s with 2-methyl phenol (2-MP; 300 11M) (e, f) or the non-agonist EG 

(300 11M) (g, h) Scales for e and g are indicated at the top in g and scales for the 
expansions are at the bottom in g. Data in e-h were obtained from the same 
cell. i, Outside-out patch-clamp recording of vector-transfected HEK293T 
membranes stimulated for 3 s with 300 LM PA, EG and 2-MP. Membranes in 
a-i were at a holding potential of —60 mV. j, Schematic model for ligand- 
gated channel properties of the OR + Or83b complex. 
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used for oocyte experiments in Fig. 4, we used a puff application of 
odour ligands. Pentyl acetate evoked unitary currents of 1.5 + 
0.06pA (n=4) in outside-out patch membranes expressing 
Or47a + Or83b and held at —60 mV (Fig. 5a, b, and Supplemen- 
tary Fig. 8a—c). We calculated a slope conductance of 27 pS for 
Or47a + Or83b from a current-voltage (L-V) curve (Supplemen- 
tary Fig. 8b). Stimulation of Or47a + Or83b-expressing cell mem- 
brane with the non-ligand eugenol did not increase currents (Fig. 5c, 
d). The same experiments performed in membranes expressing 
GPROR2 + GPROR7 produced unitary currents of 1.2 + 0.07 pA 
(n= 3) with a conductance of 20 pS at —60 mV in response to the 
cognate ligand 2-methyl phenol (Fig. 5e, f, and Supplementary Fig. 
8d, e). As seen for Or47a + Or83b, the non-ligand eugenol did not 
stimulate GPROR2 + GPROR7 currents (Fig. 5g, h). The presence 
of currents in the absence of cognate ligand suggests that both 
Or47a + Or83b and GPROR2 + GPROR7 are spontaneously active, 
which is consistent with our other results in Figs 2a, 4a and 
Supplementary Figs 3 and 6d. Untransfected HEK293T membranes 
did not show currents on stimulation with pentyl acetate, eugenol or 
2-methyl phenol (Fig. 5i). In both oocytes (Fig. 4) and mammalian 
cells (Fig. 5), single-channel events predominated. We attribute this 
either to low expression levels of these insect ORs in heterologous cell 
membrane patches or to unstable characteristics of the multimeric 
receptor in these cells. Nevertheless, these outside-out patch-clamp 
experiments using two different membrane preparations provide 
initial evidence that the insect OR complex possesses ligand-gated 
channel properties. 

Taken together, these data provide compelling evidence that a 

heteromeric complex of a conventional insect OR and the highly 
conserved Or83b family co-receptor has the characteristics of a 
cation non-selective ion channel directly gated by odour or phero- 
mone ligands (Fig. 5j). We conclude that G-protein-mediated sig- 
nalling is negligible in producing the current elicited by odour 
activation of insect OR heteromultimers. Our findings provide 
insight into long-argued insect olfactory transduction mechanisms 
and may explain the lack of clear consensus on the role of second 
messengers in this process**. The insect ORs share no homology with 
any previously described ion channel and do not contain any known 
ion selectivity filter motifs**. We observed that insect OR activity was 
not inhibited by Gd** (data not shown), a lanthanide that is a broad- 
spectrum ion channel inhibitor**”*. Therefore, although the ionic 
permeability reported here for Na*, K* and Ca** would be consis- 
tent with the properties of non-selective cation channels, a molecular 
basis for this novel ionotropic activity remains to be elucidated. The 
spontaneous activity of the OR complex found here seems to account 
for previous observations that olfactory sensory neurons exhibit 
bipolar electrical activity and become electrically negative on the 
deletion of Or83b in vivo*’*. Given that there are 62 and 79 potential 
ligand-binding OR subunits in Drosophila and Anopheles, respec- 
tively, the insect ORs may represent the largest single family of 
ion-channel-like proteins in any organism. Our work also raises 
the intriguing possibility that the insect gustatory system, which 
senses bitter and sweet tastants as well as carbon dioxide, shares this 
ionotropic coupling mechanism with the insect ORs. In fact, an 
ionotropic sugar-gated channel in fleshfly taste cells has previously 
been reported’. Our finding offers the caveat that other orphan 
receptors classified as G-protein-coupled receptors merely because 
of their putative seven-transmembrane topology may instead possess 
ligand-gated channel activities, as has been shown previously for 
light-activated channelrhodopsin”. This work has important impli- 
cations for worldwide efforts to identify specific inhibitors for the 
insect ORs, which may prove useful in controlling host-seeking beha- 
viours of disease-vector insects such as mosquitoes. 
Note added in proof: The authors of an accompanying paper’’ also 
conclude that insect ORs possess ion channel-like properties, but 
they report G,-dependent cyclic-nucleotide activation of Or83b 
alone, which we fail to observe. 
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METHODS SUMMARY 

Odorant stocks and stimulation. Stock solutions of bombykol, isoprenaline, 
forskolin, 3-isobutyl-1-methylxanthine, ruthenium red, and U73122 were pre- 
pared in dimethylsulphoxide (DMSO). Other stimuli in Figs 1-3 and 5 were 
added directly to Ringer’s solution. Stimuli in Fig. 4 were prepared in DMSO as 
1M stocks, which were then diluted into the extracellular solution. For patch- 
clamp recording and kinetic analysis, odorants were applied focally to the 
recording cell with the use of a handmade pressure-ejection system and a TTL 
(transistor—transistor logic) pulse generator. For other experiments, odorants 
and inhibitors were applied by perfusing the bath solution. 

Electrophysiology and Ca”* imaging. Two-electrode voltage-clamp recording 
from Xenopus laevis oocytes was performed as described previously’. Ca?* 
imaging from mammalian cell lines was performed as described previously”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Patch-clamp experiments in mammalian cell lines. Full-length complemen- 
tary DNAs for silkmoth ORs (BmOr-1 and BmOr-2), mosquito ORs (GPRORI, 
GPROR2 and GPROR7) and fruitfly ORs (Or47a and Or83b) were cloned into 
the pME18S vector. ORs were transiently transfected into HeLa or HEK293T 
cells with Lipofectamine 2000 reagent (Invitrogen). «;-AR and GFP or mRFP 
were co-transfected as a control. Whole-cell currents were amplified with a 
patch-clamp amplifier (CEZ-2400; Nihon-koden), and were digitized with 
PowerLab (AD Instruments). The extracellular solutions (normal Ringer’s solu- 
tion) contained (in mM): 140 NaCl, 5.6 KCI, 5 HEPES, 2.0 pyruvic acid sodium 
salt, 1.25 KH,POg, 2.0 CaCh, 2.0 MgCh, 9.4 D-glucose (pH 7.4). The electrode 
solution contained (in mM): 140 KCl, 10 HEPES, 5 EGTA-2K, 10 p-glucose 
(pH 7.4). To record the shift in reversal potential and in equilibrium potential 
for a specific ion, the following external (bath) and internal (electrode) solution 
was used: low-Na* (N-methyl-p-glutamine; NMDG) external solution (in mM: 
190 NMDG, 40 HEPES, 5.6 KCl, 2.0 pyruvic acid sodium salt, 1.25 KHPOx, 2.0 
CaCl,, 2.0 MgCl, 9.4 D-glucose; pH 7.4); low-K* (NMDG) internal solution 
(in mM: 202 NMDG, 40 HEPES, 5 EGTA-2K, 10 b-glucose; pH 7.4). For the 
outside-out recording, normal Ringer’s solution was used for the electrode solu- 
tion. The data were sampled at 20 kHz and filtered at 2 kHz. 

Xenopus oocyte electrophysiology. Oocytes were microinjected with 25 ng of 
complementary RNA (cRNA) for a conventional OR and 25 ng of cRNA for the 
Or83b family. Whole-cell currents were recorded with a two-electrode voltage- 
clamp filled with 3 M KCl, and were amplified with an OC-725C amplifier 
(Warner Instruments), low-pass filtered at 50Hz and digitized at 1 kHz. 
Outside-out patch-clamp recordings were performed 18—26h after injection. 
After removal of the vitelline layer, oocytes were transferred to a Petri dish with 
a bath solution of oocytes Ringer’s solution containing (in mM): 82.5 NaCl, 2 
KCl, 1 MgCl, 5 HEPES, 1.8 CaCl, (pH 7.5). Pipettes (4-7 MQ) were covered 
with Sylgard (Dow Chemical Company) and filled with intracellular solution 
containing (in mM): 100 KOH, 10 HEPES, 1 EGTA, 100 sulphamic acid 
(pH7.6). After seal formation (more than 5GQ), patches were excised and 
transferred to the recording chamber, where the extracellular side was continu- 
ously superfused with extracellular solution containing (in mM): 100 NaOH, 10 
HEPES, 1 MgCl,, 100 sulphamic acid (pH 7.5); this was supplemented, where 
indicated, by the odorants. Solutions were switched by computer-driven electric 
valves (General Valve Corp.). Currents were recorded with an Axopatch 200A 
amplifier (Axon Instruments, Inc.), low-passed at 1 kHz (eight-pole Bessel; 
Frequency Devices), digitized at 10kHz by means of an ITC-16 interface 
(Instrutech Corporation) and saved to a PC hard disk with PULSE v8.11 acquisi- 
tion software (HEKA Elektronic). Data were analysed with Clampfit 9.0 (Axon 
Instruments, Inc.) and Origin PRO 7 (Origin Lab). The LV curves showing ion 
permeability were produced with low-Na* solution (Na* in oocyte Ringer’s 
solution replaced by the impermeable cation NMDG") and Cl -free solution 
(Cl in oocyte Ringer’s solution replaced by sulphamic acid). These experiments 
used bath application of ligands, precluding any measurement of the response 
latency of these currents. 

Ca’* imaging. Insect ORs were transfected into HeLa or HEK293T cells, which 
were loaded with 2.5 ,1M Fura-2/AM or Fluo-4/AM for 30 min. Fluorescence was 
measured with an Aquacosmos Ca”* imaging system (Hamamatsu Photonics). 
For the simultaneous recording of whole-cell currents and intracellular Ca”* 
levels, the electrode solution (in mM: 140 KCl, 10 HEPES, 5 EGTA-2K, 10 
p-glucose, 0.05 Fluo-4-pentapotassium salt (Fluo-4-4K); pH 7.4) was used. To 
synchronize the patch-clamp amplifier, odorant stimulator and charge-coupled 
device camera (ORCA-ER; Hamamatsu), all devices were under the control ofa 
peripheral interface controller (PICI6F877; Microchip Technology, Inc.). 
cAMP assay. HEK293T cells transfected with mOR-EG, Or47a + Or83b, 
BmOr-1 + BmOr-2 or GPROR2 + GPROR7 were incubated with 1mM 
3-isobutyl-1-methylxanthine for 30 min. The cells were then exposed to the 
indicated concentration of odorant solution for 15 min. cAMP levels were deter- 
mined with an enzyme-linked immunosorbent assay kit (Applied Biosystems) in 
accordance with the manufacturer’s directions. 
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Drosophila odorant receptors are both ligand-gated 
and cyclic-nucleotide-activated cation channels 


Dieter Wicher', Ronny Schafer’, René Bauernfeind’, Marcus C. Stensmyr’, Regine Heller’, Stefan H. Heinemann® 


& Bill S. Hansson! 


From worm to man, many odorant signals are perceived by 
the binding of volatile ligands to odorant receptors' that belong 
to the G-protein-coupled receptor (GPCR) family’. They couple to 
heterotrimeric G-proteins, most of which induce cAMP produc- 
tion’. This second messenger then activates cyclic-nucleotide- 
gated ion channels to depolarize the olfactory receptor neuron, 
thus providing a signal for further neuronal processing. Recent 
findings, however, have challenged this concept of odorant signal 
transduction in insects, because their odorant receptors, which 
lack any sequence similarity to other GPCRs‘, are composed of 
conventional odorant receptors (for example, Or22a), dimerized 
with a ubiquitously expressed chaperone protein’, such as Or83b 
in Drosophila’. Or83b has a structure akin to GPCRs, but has 
an inverted orientation in the plasma membrane*’. However, G 
proteins are expressed in insect olfactory receptor neurons®, and 
olfactory perception is modified by mutations affecting the cAMP 
transduction pathway’. Here we show that application of odorants 
to mammalian cells co-expressing Or22a and Or83b results in 
non-selective cation currents activated by means of an ionotropic 
and a metabotropic pathway, and a subsequent increase in the 
intracellular Ca** concentration. Expression of Or83b alone leads 
to functional ion channels not directly responding to odorants, 
but being directly activated by intracellular cAMP or cGMP. Insect 
odorant receptors thus form ligand-gated channels as well as com- 
plexes of odorant-sensing units and cyclic-nucleotide-activated 
non-selective cation channels. Thereby, they provide rapid and 
transient as well as sensitive and prolonged odorant signalling. 

To investigate the mechanism of odorant receptor activation, we 
performed electrophysiological and fluorescence optical experiments 
on heterologously expressed odorant receptors'®"! of the fruitfly 
(Drosophila melanogaster). Co-expression of Or22a and Or83b in 
human embryonic kidney (HEK293) cells (Supplementary Fig. 1) 
and stimulation with the odorant ethyl butyrate’ led to an increased 
intracellular Ca?* concentration ({Ca?*];), as previously reported’. 
The rise in the [Ca?*]; level was dependent on the ethyl butyrate 
concentration and relied on the presence of extracellular Ca*", sug- 
gesting a Ca’* influx rather than a Ca*~ release from intracellular 
stores (Supplementary Fig. 2). 

Under voltage-clamp control at —60 mV in the whole-cell patch- 
clamp configuration, an inward current developed on short ethyl 
butyrate application (Fig. 1a), whereas no current was observed when 
HEK293 cells were transfected with either Or22a or Or83b alone 
(n= 6). The odour-induced current is carried by cations, because 
it remained unchanged in Cl -free solutions (1 = 5, not shown), and 
is composed of an ionotropic (J;) and a metabotropic (J,,) compon- 
ent. J;, which does not rely on the presence of internal ATP and GTP 


(Fig. la, bottom), activates rapidly and shows a fast decay (peak at 1 s, 
termination at 10s; see Supplementary Table 1). [,,, which relies on 
internal ATP and GTP (Fig. la, top), develops and decays consid- 
erably slower than J; (latency of 10s, peak at 60 s, termination at 80 s; 
Supplementary Table 1), but is considerably larger than [, (Fig. 1c, 
top). Experiments with outside-out patches confirmed the whole-cell 
recording results (Fig. 1b, d and Supplementary Table 2). The nature 
of the I; component is at present unclear, but could result from a 
conformational change in the odorant receptor complex in response 
to odorant binding. 

Further experiments were focused on the metabotropic current 
response. The ethyl-butyrate-induced currents showed no marked 
voltage dependence, except for a weak outward rectification (Fig. le, 
f). The instantaneous current responses following steps in the mem- 
brane potential suggest that the odorant-induced channels are con- 
stitutively active. No current was detected using symmetrical Cs* 
solutions (140 mM in pipette and bath solution; data not shown, 
n= 6). The current size depended on odorant concentration, 
saturating at about 100nM ethyl butyrate (Fig. 1f, top), whereas 
the half-maximal effect was obtained at around 700 pM (Fig. 2e). 
Part of the receptors seem to be active even in the absence of ethyl 
butyrate, because the Or22a-inhibitor benzaldehyde’? reduced the 
current recorded under control conditions (Fig. 1f, bottom, n= 6). 

To test which G proteins are involved in metabotropic odorant 
signalling, we co-expressed Or22a with ion channels reporting 
G-protein activation'*. Odorant receptors either couple to adenylyl 
cyclase stimulating G, proteins, thereby leading to increased cAMP 
levels, or to phospholipase C activating G, proteins, leading to 
inositol-1,4,5-trisphosphate formation’’. To test putative G, protein 
coupling, we co-expressed human KCNQ4 (potassium voltage-gated 
channel, KQT-like subfamily, member 4) channels, which are inhi- 
bited on activation of endogenous G, proteins". Application of ethyl 
butyrate at various concentrations had no effect on the human 
KCNQ4 current, although ethyl butyrate produced olfactory recep- 
tor currents (Supplementary Fig. 3). This indicates that odorant 
receptor activation by ethyl butyrate does not lead to Gg protein 
activation. To test putative G, protein coupling, we first co-expressed 
human HCN2 (hyperpolarization-activated cyclic-nucleotide-gated 
potassium channel 2) channels, which report changes in cyclic 
nucleotide concentrations’®. Increasing intracellular cAMP concen- 
tration due to G, protein activation causes a depolarizing shift of 
the steady-state activation curve and an acceleration of activation 
kinetics'®. These modifications of HCN2 current characteristics were 
observed after ethyl butyrate stimulation when these channels were 
co-expressed with Or22a plus Or83b, with Or22a alone, but not with 
Or83b alone (Supplementary Fig. 4). Second, we co-expressed the 
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cAMP-sensitive CNGA2 type of human cyclic-nucleotide-gated 
(CNG) channels’. In cells expressing Or22a plus Or83b or expres- 
sing Or22a alone, ethyl butyrate caused a concentration-dependent 
rise in CNGA2 current (Fig. 2a—c). Cells solely expressing CNGA2 did 
not respond to ethyl butyrate, but did respond to the adenylyl cyclase 
activator forskolin (Fig. 2b). Because CNG channels are also per- 
meable to Ca** (ref. 18), we visualized the stimulation of CNGA2 
channels by Ca** imaging. Ethyl butyrate application produced an 
increase in [Ca”*]; that was most prominent near the plasma mem- 
brane (Fig. 2d). These data suggest that odorant stimulation of the 
‘true’ olfactory receptor Or22a increases the intracellular cAMP con- 
centration by means of a G, pathway. Finally, a radioreceptor assay 
revealed a significantly increased cAMP level in cells expressing 
Or22a and Or83b on ethyl butyrate stimulation (from 1.28 + 
0.06 pmol permg protein (control) to 1.65 + 0.10 pmol per mg 
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Figure 1| Co-expressed Or22a and Or83b mediate ethyl-butyrate- 
stimulated ion currents. a, c, Recordings (a) and maxima (c) of currents 
activated by ethyl butyrate (Etb; 100 nM, applied for 1 s, red bar) under 
whole-cell voltage-clamp at -60 mV with 2 mM internal ATP and GTP (top) 
or ADP and GDP (bottom). Etb elicits a rapidly developing, transient inward 
current (J;), which overlaps with a slowly activating and slowly decaying 
current that requires the presence of ATP and GTP (Im). 1 = 18; *P < 0.05; 
***P < 0.001; Student’s f-test (c). b, d, Recordings (b) and maxima (d) of 
currents activated as in a from outside-out patches. Without ATP and GTP, 
there is only the initial response of I; (bottom), whereas, in the presence of 
ATP and GTP, both J; and I, are activated (top). n = 15; *P < 0.05; 
Student’s t-test (d). e, Current responses to the indicated pulse protocol 
obtained in the absence (Control) and in the presence (control currents 
subtracted; Etb — control) of 100 1M Etb for 1 min. f, Responses to ramp 
voltage protocols (+100 mV in 400 ms) for Etb-induced currents (at 
indicated Etb concentrations, control currents subtracted, top) and for 
currents recorded before (Control) and after application of 10 uM 
benzaldehyde (Bea) or 100 nM Etb (bottom). Error bars represent s.e.m. 
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protein (100nM ethyl butyrate); »=3 experiments; P< 0.05; 
Student’s t-test). The ethyl-butyrate -concentration dependence of 
cAMP production reported by CNGA2 currents (Fig. 2c) was similar 
to that of receptor current stimulation (Figs 1f and 2e). 

Because the G-protein-mediated signals are considerably slower 
than those arising from direct receptor activation (Fig. 1a), we asked 
whether the involvement of G proteins enhances the odour sensitivity 
of the signalling system. We therefore tested whether the non- 
hydrolysable inhibitor of G proteins guanosine 5’-(B-thio)di- 
phosphate (GDP-B-S), intracellularly applied by means of the patch 
pipette, may reduce the ethyl butyrate sensitivity. Compared with 
the ethyl butyrate response under control conditions (effector 
concentration for half-maximum response, ECs) = 677 pM; Hill 
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Figure 2 | Odour signal transduction by means of G, protein signalling. 

a, Whole-cell current responses in a cell co-expressing the cyclic-nucleotide- 
gated ion channel human CNGA2 (hCNGA2) with Or22a before and after 
application of 1 nM and 100 nM Etb. b, Relative increase in CNGA2 current 
at -80 mV induced by 100 nM Etb when hCNGA2 was expressed alone 
(Control) or co-expressed with the indicated odorant receptors. The orange 
bar indicates the stimulating effect of 10 1M forskolin in cells expressing 
hCNGA2 alone. n = 5; *P < 0.05; **P < 0.01; Student’s t-test. 

c, Concentration—response curve for Etb-induced currents in cells 
expressing hCNGA2 and Or22a, measured at —80 mV and normalized to 
controls (n = 5). The continuous curve is a Hill fit (EC59 = 3 nM, Hill 
coefficient = 0.54). d, Free [Ca?*]; (in nM) in a HEK293 cell expressing 
hCNGA2 and Or22a before (Control) and 5 min after application of Etb or 
10 uM forskolin. Scale bar, 10 tm. e, Concentration—response curves for 
Etb-induced currents under control conditions and with 500 uM GDP-B-S in 
the pipette to disrupt G protein activation (n = 9, Control; n = 11, GDP-B-S; 
***P < 0.001; ANOVA). The continuous curves are Hill fits described by 
ECs values of 677 pM and 70 nM and Hill coefficients of 0.61 and 1.1 for the 
control and GDP-f-S, respectively. f, Current families according to the pulse 
protocol of Fig. le before (Control) and after bath application of 8-bromo- 
cAMP (8-br-cAMP) in a cell expressing Or83b and Or22a. Error bars 
represent s.e.m. 
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coefficient of 0.61), the presence of GDP-f-S caused a considerable 
shift in the ethyl-butyrate-concentration dependence (ECs = 
70 nM; Hill coefficient of 1.1; Fig. 2e). This indicates that the iono- 
tropic pathway of receptor activation requires significantly higher 
odour concentrations than the metabotropic pathway. 

The cAMP signal originating from Or22a stimulation may subse- 
quently initiate the activation of an ion channel complex requiring 
Or83b. If this were true, an artificial increase of the cAMP level in 
HEK293 cells expressing Or22a and Or83b should mimic the effect 
of ethyl butyrate. Indeed, bath application of the membrane- 
permeable cAMP analogue 8-bromo-cAMP induced a membrane 
current similar to that produced by ethyl butyrate (Fig. 2f). More- 
over, 8-bromo-cAMP induced a similar current in cells expressing 
Or83b alone (Fig. 3a). The 8-bromo-cAMP failed, however, to 
elicit a current response in cells solely expressing Or22a (Fig. 3a). 
Stimulation of cAMP production by forskolin yielded large current 
responses of twice the size as those obtained with 8-bromo-cAMP, 
supporting the notion that cAMP serves as an activating ligand 
for Or83b (Fig. 3a). To illustrate that the current activated by 
8-bromo-cAMP is identical to the current elicited by ethyl butyrate, 
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Figure 3 | Cyclic nucleotides activate odorant receptors. a, Maximal 
inward currents at—100 mV activated by 5 UM 8-bromo-cAMP (8-br-cAMP) 
or 50 uM forskolin in cells expressing odorant receptors as indicated (n = 7; 
***P < 0.001; Student’s t-test). b, Scheme of endogeneous cAMP 
production by AKH I-induced stimulation of the G,-protein-activating P. 
americana adipokinetic hormone receptor AKHR (top). Current responses 
to ramp voltage protocols (+100 mV in 400 ms) recorded from cells 
expressing P. americana AKHR either alone or together with Or83b on 
application of 10 nM AKH I (difference curves: 5 min AKH I— before AKH I, 
bottom). The bath solution contained the PLC inhibitor U73122 (5 uM) to 
depress G,-protein-related signalling. c, Current families recorded in a cell 
expressing Or83b before (Control) and after stimulation by 8-bromo-cGMP. 
d, Concentration—response curves for current activation by 8-bromo-cAMP 
and 8-bromo-cGMP in cells expressing Or83b (— 100 mV; *P < 0.05; 
ANOVA test). e, Free [Ca**]; (in nM) in a cell expressing Or83b under 
control conditions and 3 min after application of 8-bromo-cAMP and 
8-bromo-cGMP (after 5 min of washing). The histogram shows [Ca?*]; ina 
region at the lower pole of the cell (red area). Error bars represent s.e.m. 
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cells expressing Or22a and Or83b were first stimulated with 
8-bromo-cAMP at a high concentration of 100 uM and subsequently 
treated with 100 uM ethyl butyrate. Ethyl butyrate failed to cause a 
further increase in the current under these conditions (+ 1.6 + 3.8% 
of current induced by 8-bromo-cAMP). To test further the notion 
that increased intracellular cAMP levels lead to activation of Or83b, 
we next co-expressed Or83b with the Periplaneta americana adipo- 
kinetic hormone receptor (AKHR), which is known to activate G, 
proteins. Stimulation of P. americana AKHR with the adipokinetic 
hormone I (AKH]I) induced a current (294 + 64pA at —100 mV, 
n= 10), but failed to do so when only AKHR was expressed 
(12 = 24pA, n= 6; Fig. 3b). 

The results reported so far for the metabotropic odour response 
indicate that the Drosophila olfactory receptor proteins Or22a and 
Or83b form a complex serving as both a GPCR and an ion channel. 
To clarify the role of cAMP in channel activation, we tested whether 
the activity of a cAMP-dependent protein kinase (protein kinase A) 
was necessary. Ion currents elicited by 8-bromo-cAMP stimulation 
of Or83b-expressing cells were not significantly altered by preincu- 
bation with the protein kinase A inhibitors KT5720 (10 uM) or 
PKI1422 (1 1M; n = 6; data not shown). Thus, channel phosphoryla- 
tion by protein kinase A can be excluded as an activation mechanism. 

Ion channels activated or modulated by cAMP, such as CNG or HCN 
channels, are also sensitive to cCGMP"*. In Or83b-expressing HEK293 
cells, 8-bromo-cGMP elicited a current similar to that obtained on 
8-bromo-cAMP application (Fig. 3c). The concentration—response 
relationships for both cyclic nucleotides show that 8-bromo-cGMP is 
a significantly better agonist than 8-bromo-cAMP for current activation 
(Fig. 3d) and for the elevation of [Ca**]; (Fig. 3e). 

The ion selectivity of the channels formed by Or83b was assayed by 
changing the composition of the bath solutions. By measuring the 
reversal potentials (V;ey) in Ca?*-free and Na‘ -free bath solutions, 
permeability ratios were determined according to equations (1) and 
(2), respectively (see Methods Summary): Pya:Px = 1.1 £ 0.1 from 
Vey of +4.0+ 1.1mV (Ca**-free, n=7), and PoyiPk =5.440.2 
from V,., of +1.1+0.4mV (Na‘-free, n=7). Or83b therefore 
forms a non-selective cation channel characterized by the permea- 
bility sequence Poa: PNa:Pe = 5.4:1.1:1. 

To test whether cyclic nucleotides also activate olfactory receptors 
containing odorant receptor proteins different from Or22a, Or83b 
was co-expressed with Or47a. Cells responding to the Or47a agonist 
pentyl acetate’? were also responsive to 8-bromo-cAMP and 8-bromo- 
cGMP, indicating that odorant receptor complex activation by cyclic 
nucleotides is not a phenomenon restricted to Or22a (Supplementary 
Fig. 5). 

Comparing the results for activation and ion selectivity of Or83b 
channels with those of CNG and HCN channels (both non-selective 
cation channels that pass monovalent cations) reveals similarities but 
also differences. The activation of Or83b is similar to that of CNG 
channels. Furthermore, Ca’* ions permeate Or83b and CNG 
channels” much more readily than they permeate HCN channels”. 
However, although Cs” easily permeates CNG channels, it blocks 
Or83b moderately and HCN potently”. 

In green algae, two GPCRs form light-gated ion channels: the 
proton-conducting channelrhodopsin-1 (ref. 21) and the non- 
selective-cation-conducting channelrhodopsin-2 (ref. 22). Or83b 
shows only a low degree of sequence identity with these proteins 
(13.0% amino acid identity with channelrhodopsin-2). Sequence 
comparison of Or83b with Drosophila ion channels, including 
CNG and HCN, also reveals a low degree of similarity. However, 
the motif TVVGYLG (393-399), located near the cytoplasmic border 
of the sixth predicted transmembrane helix of Or83b, is reminiscent 
of the selectivity filter motif TVGYG in the pores of K* channels”. 
Considering this motif as part of a putative selectivity filter, its modi- 
fication might affect the ion permeability. Deletion of two residues in 
this motif, as indicated in Fig. 4a (“GYG mutation’), altered the slight 
outward rectification of wild-type Or83b currents (Figs 1f, 2f and 3b) 
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into a moderate inward rectification (Fig. 4a). Using both Ca**-free 
and Na‘-free bath solutions, the ratio of currents measured at 
+100 mV and —100 mV was significantly reduced in GYG-expressing 
cells, indicating an attenuated K* conductance (Fig. 4b). Additionally, 
the GYG mutation caused a shift in the reversal potentials. V-ey was 
+24.8 + 2.4mV for Ca?*-free (n= 5) and +16.5 + 5.1 mV for Na’ - 
free (n= 7) bath solutions (Fig. 4c). The permeability ratios deter- 
mined according to equations (1) and (2) were Pya:Px = 2.5 + 0.3 
and Pca:Px = 19.6 = 8.9, indicating a reduction in K* permeability 
with respect to the wild type. These experiments demonstrate that 
Or83b forms ion channels in HEK293 cells and that the TVVGYLG 
motif is involved in controlling the ion permeability of the channel 
(Fig. 4d). 

The pair of Drosophila olfactory receptor proteins Or22a—Or83b 
represents a typical insect odorant receptor. Homologues of Or83b 
have been identified in various other insect species. The design 
principle of insect odorant receptors being composed of both an 
odorant-sensitive protein and a protein finally transducing the 
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Figure 4 | Modification of a putative Or83b pore region alters ion 
permeability. a, The wild type (WT) Or83b amino acids Val 394 and 

Leu 398 were deleted in a mutant termed ‘GYG? (left panel). Family of 
currents activated by 8-bromo-cAMP in a cell expressing the Or83b GYG 
mutant (right panel; recording protocol as in Fig. 1e; ‘0’ indicates the 
baseline). b, Degree of current rectification of 8-bromo-cAMP-activated 

(5 tM) wild-type and mutant channels, determined as the absolute value of 
currents obtained at +100 mV relative to those at -100 mV. The bath/ 
pipette solutions contained in mM: 140 Na*/140 K* (Na‘/K~) and 

10 Ca**/140 K* (Ca?*/K*). n = 7; *P < 0.05, **P < 0.01; Student’s t-test. 
c, Superimposed current responses to ramp voltage protocols (+100 mV in 
400 ms) on stimulation of HEK293 cells expressing wild type or GYG mutant 
with 5 uM 8-bromo-cAMP under conditions as in b. d, Scheme of odorant 
receptor and channel function. When stimulated by an odorant such as Etb, 
Or22a activates the channel protein Or83b by an ionotropic and a 
metabotropic pathway. The direct activation of Or83b by Or22a (light blue 
arrow) leads to a fast and transient ion conductance. The metabotropic 
pathway includes activation of G, proteins by Or22a. The Ga, subunit 
stimulates adenylyl cyclase (AC), thereby increasing cAMP production. 
Cyclic AMP slowly activates a long-lasting non-selective cation conductance. 
Opening of the Or38b channel that harbours the TVVGYLG motif leads to 
Na* and Ca” influx and membrane depolarization. 
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chemical message into an electrical signal ensures very rapid recog- 
nition of high odour concentrations by means of the ionotropic 
pathway as well as a somewhat slower but prolonged and highly 
sensitive odour detection via the G-protein-mediated signal amp- 
lification (Fig. 4d). 

Note added in proof: A study by Sato et al.** shows that heteromeric 
insect olfactory receptor complexes from different species form 
ligand-gated ion channels when expressed in various heterologous 
systems. 


METHODS SUMMARY 

Cell culture and transfection. HEK293 cells were cultured as described”. Cells 
were transfected with 1 tg pcDNA3-dOr22a-GFP (green fluorescent protein) 
and/or 1 wg pcDNA3-dOr83b-GFP using Roti-Fect (Roth). The DNAs were a 
gift from E. Neuhaus and H. Hatt’. 

Electrophysiology. Ion currents in HEK293 cells were measured with the patch- 
clamp technology as described”’. The permeability ratios were calculated accord- 
ing to 


PyatPx = [K*]/[Na*], X exp(VyevF/RT) (1) 
and 
PoaiPx = [K*];/4[Ca?*]o X exp( Veev2F/RT) (2) 


where P;,,, are the permeabilities, [ion]; and [ion], are the internal and 
external activity of the respective ion, V,., is the reversal potential”, F is the 
Faraday constant, R is the gas constant, and T is the absolute temperature. 
To rule out an effect on endogenous HEK293 channels”, ethyl butyrate, 
8-bromo-cAMP and 8-bromo-cGMP were first tested on non-transfected cells 
(n= 5-7). 

Calcium imaging. Free intracellular Ca?* concentration ({Ca?*];) was deter- 
mined by Fura-2 fluorescence as described’*. 

Radioreceptor assay. The cAMP content of cells was determined as described in 
Methods. 

Data presentation and statistics. Results are given as means + standard error of 
mean (s.e.m.), where m represents the number of cells. The evaluation of statis- 
tical significance of differences was performed with Student’s test for testing 
one variable and two-way ANOVA for testing two variables. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 23 July 2007; accepted 18 February 2008. 
Published online 13 April 2008. 


1. Buck, L. & Axel, R. A novel multigene family may encode odorant receptors: a 
molecular basis for odor recognition. Cell 65, 175-187 (1991). 

2. Mombaerts, P. Seven-transmembrane proteins as odorant and chemosensory 
receptors. Science 286, 707-711 (1999). 

3. Ronnett, G. V.& Moon, C. G proteins and olfactory signal transduction. Annu. Rev. 
Physiol. 64, 189-222 (2002). 

4. Benton, R., Sachse, S., Michnick, S. W. & Vosshall, L. B. Atypical membrane 
topology and heteromeric function of Drosophila odorant receptors in vivo. PLoS 
Biol. 4, e20 (2006). 

5. Neuhaus, E. M. et al. Odorant receptor heterodimerization in the olfactory system 
of Drosophila melanogaster. Nature Neurosci. 8, 15-17 (2005). 

6. Larsson, M. C. et al. Or83b encodes a broadly expressed odorant receptor 
essential for Drosophila olfaction. Neuron 43, 703-714 (2004). 

7. Wistrand, M., Kall, L. & Sonnhammer, E. L. A general model of G protein-coupled 
receptor sequences and its application to detect remote homologs. Protein Sci. 15, 
509-521 (2006). 

8. Riitzler, M., Lu, T. & Zwiebel, L. J. Ga encoding gene family of the malaria 
vector mosquito Anopheles gambiae: expression analysis and 
immunolocalization of AG, and AG, in female antennae. J. Comp. Neurol. 499, 
533-545 (2006). 

9. Martin, F., Charro, M. J. & Alcorta, E. Mutations affecting the cAMP 
transduction pathway modify olfaction in Drosophila. J. Comp. Physiol. A 187, 
359-370 (2001). 

10. Wetzel, C. H. et al. Functional expression and characterization of a Drosophila 
odorant receptor in a heterologous cell system. Proc. Natl Acad. Sci. USA 98, 
9377-9380 (2001). 

11. Katada, S., Nakagawa, T., Kataoka, H. & Touhara, K. Odorant response assays for 
a heterologously expressed olfactory receptor. Biochem. Biophys. Res. Commun. 
305, 964-969 (2003). 


©2008 Nature Publishing Group 


NATURE| Vol 452|24 April 2008 


Dobritsa, A. A., van der Goes van Naters, W., Warr, C. G., Steinbrecht, R. A. & 
Carlson, J. R. Integrating the molecular and cellular basis of odor coding in the 
Drosophila antenna. Neuron 37, 827-841 (2003). 

Pelz, D., Roeske, T., Syed, Z., de Bruyne, M. & Galizia, C. G. The molecular 
receptive range of an olfactory receptor in vivo (Drosophila melanogaster Or22a). 
J. Neurobiol. 66, 1544-1563 (2006). 

Wicher, D. et al. Differential receptor activation by cockroach adipokinetic 
hormones produces differential effects on ion currents, neuronal activity and 
locomotion. J. Neurophysiol. 95, 2314-2325 (2006). 

Barry, P. H. The relative contributions of cAMP and InsP3 pathways to olfactory 
responses in vertebrate olfactory receptor neurons and the specificity of odorants 
for both pathways. J. Gen. Physiol. 122, 247-250 (2003). 

Ludwig, A. et al. Two pacemaker channels from human heart with profoundly 
different activation kinetics. EMBO J. 18, 2323-2329 (1999). 

Dhallan, R. S., Yau, K. W., Schrader, K. A. & Reed, R. R. Primary structure and 
functional expression of a cyclic nucleotide-activated channel from olfactory 
neurons. Nature 347, 184-187 (1990). 

Craven, K. B. & Zagotta, W. N. CNG and HCN channels: two peas, one pod. Annu. 
Rev. Physiol. 68, 375-401 (2006). 

Hallem, E. A., Ho, M. G. & Carlson, J. R. The molecular basis of odor coding in the 
Drosophila antenna. Cell 117, 965-979 (2004). 


. Kaupp, U. B. & Seifert, R. Cyclic nucleotide-gated ion channels. Physiol. Rev. 82, 


769-824 (2002). 
Nagel, G. et al. Channelrhodopsin-1: a light-gated proton channel in green algae. 
Science 296, 2395-2398 (2002). 


. Nagel, G. et al. Channelrhodopsin-2, a directly light-gated cation-selective 


membrane channel. Proc. Natl Acad. Sci. USA 100, 13940-13945 (2003). 
Doyle, D. A. et al. The structure of the potassium channel: molecular basis of K* 
conduction and selectivity. Science 280, 69-77 (1998). 


. Sato, K. et al. Insect olfactory receptors are heteromeric ligand-gated ion 


channels. Nature doi:10.1038/nature06850 (this issue). 


LETTERS 


25. Wicher, D., Agricola, H. J., Schénherr, R., Heinemann, S. H. & Derst, C. TRPy 
channels are inhibited by cAMP and contribute to pacemaking in neurosecretory 
insect neurons. J. Biol. Chem. 281, 3227-3236 (2006). 

26. Hille, B. lon Channels of Excitable Membranes 441-470 (Sinauer Associates, 
Sunderland, 2001). 

27. Bugaj, V. et al. Functional properties of endogenous receptor- and store- 
operated calcium influx channels in HEK293 cells. J. Biol. Chem. 280, 
16790-16797 (2005). 

28. Messutat, S., Heine, M. & Wicher, D. Calcium-induced calcium release in 
neurosecretory insect neurons: fast and slow responses. Cell Calcium 30, 199-211 
(2001). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We thank S. Arend, S. Dietel, S. Kaltofen, A. Rof&ner and 

R. Stieber for technical assistance, E. Grosse-Wilde for discussion, E. Neuhaus and 
H. Hatt for providing the Or22a and Or83b DNA, and M. Biel for providing human 
HCN2 DNA. This study was supported by the Max Planck Society (D.W., M.C.S. 
and B.S.H.) and SFB 604 (TP 4, S.H.H.). 


Author Contributions D.W. designed the experiments, performed the whole-cell 
patch-clamp and the calcium-imaging experiments, and analysed the data. R.B. 
and R.S. made the laser scanning recordings and contributed to the 
electrophysiological experiments. R.H. performed the cAMP radioreceptor assay. 
M.C.S. contributed to the bioinformatics and molecular biology effort. D.W. and 
S.H.H. wrote the paper. B.S.H. and M.C.S. contributed to the manuscript. All 
authors discussed the concepts and results, and commented on the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. Correspondence and requests for materials should be 
addressed to D.W. (dwicher@ice.mpg.de). 


1011 


©2008 Nature Publishing Group 


doi:10.1038/nature06861 


METHODS 

Cell culture and transfection. HEK293 cells were cultured at a density of 
~2 10* per 35-mm dish and transfected with 1 pg pcDNA3-dOr22a-GFP* 
and 1 jig pcDNA3-dOr83b-GFP° using Roti-Fect transfection kit (Roth). In 
some experiments, we transfected cells with pcDNA3-dOr22a-GFP or 
pcDNA3-dOr83b-GFP alone. We also examined the properties of dOr47a, 
which was cloned, using standard molecular biology methods, into a 
pcDNA3.1(—) expression vector. To test for putative G-protein coupling, cells 
were transfected with the respective odorant receptor DNA together with 1.5 1g 
hHCN2/pcDNA3 (provided by M. Biel), with 1 ug hKCNQ4/pcDNA3.1 or with 
1 pg hCNGA2/pCMV6-XL4 (Origene). For increasing the cAMP level by means 
of AKHR stimulation, cells were transfected with 1 ug P. americana AKHR/ 
pcDNA3.1. 

For electrophysiological experiments, we only used cells showing GFP 
fluorescence (when illuminated at 470 nm) as an indicator of putative odorant 
receptor expression. 

In addition, we also generated amino-terminal GFP-fusion proteins of Or22a 
and Or83b. These were constructed according to standard molecular biology 
protocols. In brief, the odorant receptor genes were cloned into a pAcGFP1-C1 
mammalian expression vector (Clontech Laboratories) in frame with the GFP 
coding sequence, with no intervening stop codon, thus fusing the Aequorea 
coerulescens (Ac) GFP molecule to the N terminus of the odorant receptor genes. 
These constructs were then transfected as outlined above. 

Site-directed PCR mutagenesis was used to introduce two mutations into the 

putative pore region of Or83b. Oligonucleotide primers were designed to replace 
the wild-type TVVGYLG motif with TVGYG. The mutation was subsequently 
confirmed by means of sequencing. 
Electrophysiology. Ion currents in HEK293 cells were measured at room tem- 
perature (23 °C) using whole-cell patch-clamp with appropriate compensation 
of series resistance and of capacitive currents. Additional experiments were 
performed in the outside-out configuration. Current measurements and data 
acquisition were performed using an EPC9 patch-clamp amplifier controlled 
by PatchMaster software (both HEKA Elektronik). Patch-clamp pipettes were 
fabricated from borosilicate capillaries. Pipettes for whole-cell recordings had 
resistances of 2-4 MQ; for excised-patch recordings, the pipette resistance was 
up to 15 MQ. 

The standard pipette solution contained (in mM) 140 KCL, 4 NaCl, 2.2 CaCh, 
2 Mg-ATP, 0.05 Na-GTP, 5 EGTA and 10 HEPES (pH 7.3), and the standard 
bath solution contained (in mM) 135 NaCl, 5 KCl, 1 CaCl, 1 MgCl, 10 HEPES 
and 10 glucose (pH 7.4). To differentiate between ionotropic and metabotropic 
pathways of receptor activation (Fig. la—d), the pipette solution contained either 
2mM ATP plus 2mM GTP or 2mM ADP plus 2mM GDP (both for the whole- 
cell and the outside-out configuration). For the preparation of Cl -free solu- 
tions, the chloride salts were substituted by gluconate salts. The pipette Cs* 
solution contained (in mM) 140 CsCl, 1 Mg-ATP, 10 EGTA and 10 HEPES 
(pH7.3), and the Cs* bath solution contained (in mM) 140 CsCl, 15 glucose 
and 15 HEPES (pH 7.4). The Ca’*-free bath solution contained (in mM) 140 
NaCl, 15 glucose and 15 HEPES, and the Na* -free bath solution contained (in 
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mM) 140 N-methyl-p-glucamine (NMDG)-Cl, 10 CaCl, 10 glucose and 15 
HEPES. 

Calcium imaging. Cells were loaded with Fura-2 by incubation in bath solution 
containing 2 1M Fura-2/acetomethylester for 20min. Free intracellular Ca** 
concentration ([Ca”*];) was determined using the fluorescence ratio method. 
Excitation of Fura-2 at 340 nm and 380 nm was performed with a monochro- 
mator (Polychrome II, TILL Photonics) coupled by means of an epifluorescence 
condenser into an Axioskop FS microscope (Carl Zeiss) with a water immersion 
objective (LUMPFL 40xW/IR/0.8; Olympus). Emitted light was separated by a 
400-nm dichroic mirror and filtered with a 420-nm long-pass filter. 

[Ca**]; was calculated according to the equation 


[Ca**]; = Keg(R — Rmin)/(Rmax ~ R) (3) 


Kets Rmin aNd Rmax Were determined using HEK293 cells permeabilized with 
2 uM ionomycin and three solutions with different concentrations of free Ca”* 
(Ca** -free, 5mM Ca** and 500 nM Ca?*; the composition of the 500 nM solu- 
tion was calculated with WEBMAXC v.2.20)”’. The values of Kegs Rmin and Rmax 
were 3.72 UM, 0.38 UM and 5.9 uM, respectively. 

Fluorescence images were acquired using a cooled CCD camera controlled by 
TILLVision 4.0 software (TILL Photonics). The resolution was 640 X 480 pixel 
in a frame of 175 X 130 pm (X40/IR/0.8 objective). Image pairs were obtained 
by excitation for 100 ms at 340 nm and 380 nm; background fluorescence was 
subtracted. 

Radioreceptor assay. Monolayers of HEK283 cells were incubated for 30 min in 
HEPES Ringer’s buffer containing 1 mM isobutylmethylxanthine, and stimu- 
lated with 100 nM ethyl butyrate for 15 min. The reaction was stopped with 96% 
ethanol, and after evaporation, 50 mM Tris/4 mM EDTA (pH7.5) was added. 
The cAMP content of the cellular extracts was measured by a cAMP [°H] assay 
system (GE Healthcare) using the affinity of a purified cAMP-binding protein 
anda charcoal separation step according to the instructions of the manufacturer. 
Proteins were measured in parallel samples, and intracellular cAMP concentra- 
tion was expressed as pmol mg ! cell protein. 

Substance application. With the exception of GDP-B-S, which was applied by 
means of the patch pipette, all substances were applied to the bath using either a 
bath perfusion system (BPS4 from ALA) or a rapid solution changer (RSC160 
from Biologic), which were controlled by the PatchMaster software (HEKA 
Elektronik). 

Data analysis. For data analysis, the software IgorPro (WaveMetrics) or Prism 4 
(Graph Pad Software) were used. 

Chemicals. 8-bromo-cAMP, 8-bromo-cGMP, benzaldehyde, ethyl butyrate, 
forskolin, GDP-B-S, KT5720 and pentyl acetate were obtained from Sigma; 
myristoylated protein kinase A inhibiting peptide 14-22 amide (PKI1422) and 
U73122 were from Calbiochem; and Fura-2 and wheat germ agglutin were from 
Invitrogen. 


29. Patton, C., Thompson, S. & Epel, D. Some precautions in using chelators to buffer 
metals in biological solutions. Cell Calcium 35, 427-431 (2004). 
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Upper intestinal lipids trigger a gut-brain-liver axis 
to regulate glucose production 


Penny Y. T. Wang’, Liora Caspi', Carol K. L. Lam’*, Madhu Chari’, Xiaosong Li*, Peter E. Light°, 


Roger Gutierrez-Juarez*, Michelle Ang’, Gary J. Schwartz* & Tony K. T. Lam 


Energy and glucose homeostasis are regulated by food intake and 
liver glucose production, respectively. The upper intestine has a 
critical role in nutrient digestion and absorption. However, studies 
indicate that upper intestinal lipids inhibit food intake as well in 
rodents and humans by the activation of an intestine-brain axis'~*. 
In parallel, a brain—liver axis has recently been proposed to detect 
blood lipids to inhibit glucose production in rodents’. Thus, we 
tested the hypothesis that upper intestinal lipids activate an 
intestine-brain-—liver neural axis to regulate glucose homeostasis. 
Here we demonstrate that direct administration of lipids into the 
upper intestine increased upper intestinal long-chain fatty acyl- 
coenzyme A (LCFA-CoA) levels and suppressed glucose production. 
Co-infusion of the acyl-CoA synthase inhibitor triacsin C or the 
anaesthetic tetracaine with duodenal lipids abolished the inhibition 
of glucose production, indicating that upper intestinal LCFA-CoAs 
regulate glucose production in the preabsorptive state. Subdia- 
phragmatic vagotomy or gut vagal deafferentation interrupts the 
neural connection between the gut and the brain, and blocks the 
ability of upper intestinal lipids to inhibit glucose production. 
Direct administration of the N-methyl-p-aspartate ion channel 
blocker MK-801 into the fourth ventricle or the nucleus of the 
solitary tract where gut sensory fibres terminate abolished the 
upper-intestinal-lipid-induced inhibition of glucose production. 
Finally, hepatic vagotomy negated the inhibitory effects of upper 
intestinal lipids on glucose production. These findings indicate that 
upper intestinal lipids activate an intestine—brain-—liver neural axis 
to inhibit glucose production, and thereby reveal a previously unap- 
preciated pathway that regulates glucose homeostasis. 
Intraduodenal administration of lipids, in particular long-chain 
fatty acids (LCFA), rapidly inhibits food intake in both rodents and 
humans®’. The small intestinal lipids activate sensory (afferent) neu- 
rons that project directly to the nucleus of the solitary tract (NTS)*. 
Intraduodenal infusion of the topical anaesthetic tetracaine blocks 
neural communication of intestinal vagal sensory fibres and abolishes 
the inhibitory feeding effects of intestinal lipids®. Furthermore, 
intestinal lipids fail to reduce food intake in rodents with gut vagal 
deafferentation’. Together, these findings indicate that upper intest- 
inal lipids negatively regulate food intake in the preabsorptive state 
by activation of the intestine-brain neural axis. On nutrient absorp- 
tion, the brain directly detects a rise in both blood nutrients and 
hormones to negatively regulate food intake and maintain energy 
balance'®*. Recent studies indicate that the hypothalamic sensing 
mechanisms regulate peripheral metabolic processes as well'**°. A 
selective increase of hypothalamic lipids caused by either lipid 
administration’ or inhibition of hypothalamic carnitine palmitoyl- 
transferase-1 (ref. 20) lowers glucose production (GP) and plasma 
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glucose levels. Furthermore, hepatic vagotomy abolishes the ability of 
the hypothalamic lipids to lower GP°. These studies indicate that 
hypothalamic nutrients activate a brain—liver neural axis to maintain 
glucose homeostasis. 

On the basis of these findings, we tested the hypothesis that acute 
accumulation of lipids in the upper intestine triggers an intestine— 
brain—liver neurocircuitry in the preabsorptive state to regulate glu- 
cose homeostasis. Together with findings demonstrating the role of 
intestinal lipids in food intake®’, we propose that intestinal-lipid- 
sensing mechanisms represent one of the first lines of metabolic 
defenses against nutrient excess, providing energy and metabolic 
balance by rapidly downregulating both GP and food intake on 
nutrient exposure. 

To examine whether upper intestinal lipids regulate GP in the 
preabsorptive state (Fig. la), we first established an intraduodenal 
lipid-infused in vivo model. On the day of the in vivo experiment, the 
fully recovered conscious rats were first subjected to a pancreatic 
(basal insulin) clamp to obtain steady state basal measurements of 
glucose metabolism (Fig. 1b and Supplementary Table 1). Intralipid 
was then administered directly into the duodenum at 0.03 kcal min! 
for 50 min (Fig. 1b). Intraduodenal administration of lipids increased 
upper intestinal LCFA-CoA levels (Fig. 1c) but did not alter plasma 
and portal free fatty acids (FFA)/triglycerides (TG) (Supplementary 
Fig. la, b) or liver LCFA-CoA levels (Supplementary Fig. 1c). These 
findings are consistent with previous reports indicating that intra- 
duodenal lipid administered at 0.22 kcal min | (for 30-40 min) did 
not affect plasma lipid levels”’. In the presence of basal circulating lipid 
and insulin levels (Supplementary Fig. 1a, b and Supplementary Table 
1), intraduodenal lipid administration increased the exogenous glu- 
cose infusion rate by 2.5-fold compared to the vehicle to maintain 
euglycemia during the pancreatic clamp (Fig. 1d and Supplementary 
Table 1). The increase in glucose infusion rate was fully accounted for 
by a ~55% inhibition of GP (Fig. le, f), because the rate of glucose 
uptake remained constant (Fig. 1g). Under non-clamped physio- 
logical settings, intraduodenal lipid administration significantly low- 
ered plasma glucose levels by 15 min, and this glucose-lowering effect 
persisted throughout the 50 min protocol (Fig. 1h). Intraduodenal 
lipid administration did not alter peripheral and portal FFA/TG levels 
(Supplementary Fig. 1d) but lowered insulin levels (Fig. 1h) by 15 min 
(when the plasma glucose drop first became evident), indicating an 
enhancement of insulin sensitivity. These data collectively indicate 
that an acute rise in upper intestinal lipids decreased GP and plasma 
glucose levels. 

We next co-infused the inhibitor of acyl-CoA synthetase triacsin 
C with lipids to investigate whether the formation of upper 
intestinal esterified lipids (LCFA-CoA) is required to regulate GP. 


‘Toronto General Hospital Research Institute, University Health Network, Toronto M5G 1L7, Canada. *Department of Physiology, and *Department of Medicine, University of Toronto, 
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Co-administration of triacsin C (80 uM in 0.5 ml) with lipids abo- 
lished the increase in upper intestinal LCFA-CoA levels (Fig. 1c) and 
glucose infusion rate (Fig. 1d) and the reduction in GP (Fig. le, f) 
induced by lipids. Again, there was no difference in glucose uptake 
between groups (Fig. 1g). Administration of triacsin C alone did not 
affect peripheral glucose metabolism (Fig. 1d—g and Supplementary 
Table 2). Thus, we have demonstrated for the first time, to our 
knowledge, that an acute increase of upper intestinal LCFA-CoAs 
suppressed GP. 

We propose that preabsorptive upper intestinal lipids first elicit an 
afferent neuronal signal to the hindbrain. The hindbrain then pro- 
cesses this and relays the signal to the liver to lower GP (Fig. 2a). To 
test this hypothesis, we first infused the topical anaesthetic tetracaine 
(0.5mg in 0.5ml) locally to inhibit the neurotransmission of 
vagal afferent fibres innervating the upper intestine (Fig. 2b). Intra- 
duodenal infusion of tetracaine alone did not affect whole-body glu- 
cose metabolism (Fig. 2c—f and Supplementary Table 2). In contrast, 
tetracaine administration, when co-infused with lipids, significantly 
reduced the ability of duodenal lipids to enhance peripheral insulin 
sensitivity and to lower GP (Fig. 2c—e). These data indicate that 
preabsorptive lipids regulate GP, possibly through a neuronal net- 
work. To assess the potential role of the gut extrinsic sensory innerva- 
tion in this response, we performed intraduodenal lipid infusion in 
rats subjected to either subdiaphragmatic vagatomy or gut vagal 
deafferentation (Fig. 2a, b). Both surgical procedures interrupt 
the vagal afferent innervations from the gut to the brain but when 
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performed alone did not affect peripheral glucose metabolism within 
two weeks (Fig. 2c-f and Supplementary Table 2). In contrast, intra- 
duodenal lipids failed to increase the glucose infusion rate and lower 
GP during the clamp in rats subjected to either subdiaphragmatic 
vagatomy or gut vagal deafferentation (Fig. 2c—e). Glucose uptake 
(Fig. 2f), plasma insulin and glucose levels (Supplementary Table 1) 
were comparable in all groups. Taken together, these independent yet 
complementary findings suggest that the vagal afferent innervation is 
required for the neurotransmission triggered by upper intestinal 
lipids to lower GP. 

The caudal hindbrain NTS senses afferent neuronal signals to 
maintain energy balance’*. Glutamatergic neurotransmission has 
been demonstrated in gut-recipient NTS neurons, and N-methyl- 
D-aspartate (NMDA) ion channels and receptors have been localized 
to vagal afferent terminals in the hindbrain NTS*”*. Importantly, 
direct administration of the NMDA channel blocker MK-801 into the 
fourth ventricle** or the NTS” extends meal duration, probably by 
modulating glutamatergic transmission at central vagal afferent 
terminals. Here we tested the hypothesis that glutamatergic trans- 
mission in the NTS relays afferent neuronal signals from the 
duodenal lipids to the liver to subsequently lower GP (Fig. 3a). We 
implanted a single catheter into the fourth ventricle and performed 
intracerebroventricular (ICV) administration of MK-801 into the 
fourth ventricle (0.42 ug min”! at a rate of 0.083 pl min ') in fully 
recovered rats (Fig. 3b). ICV infusion of MK-801 into the fourth 
ventricle began with the start of the clamp and was maintained 
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Figure 1| Upper intestinal lipids suppress liver glucose production. 

a, Schematic representation of the working hypothesis. Lipid with or without 
triacsin C (Tri-C) was infused through a duodenal catheter. VEH, vehicle. 
b, Experimental procedure and clamp protocol. SRIF, somatostatin. 
Duodenal catheter or venous and arterial (IV) catheters were implanted on 
day 1. On the evening before the pancreatic (basal insulin) clamp studies, all 
rats received a fixed portion of calories (~56 kcal) to ensure a comparable 
post-absorptive state at the start of the clamp experiments. c—e, During the 
pancreatic clamp, intraduodenal lipid infusion increased upper intestinal 
LCFA-CoA levels (¢, *P < 0.05 versus other groups), increased glucose 
infusion rate (d, *P < 0.01 versus other groups) and decreased GP 

(e, *P < 0.001 versus other groups). In contrast, co-infusion with Tri-C 


negated the effects of lipids on upper intestinal LCFA-CoA levels (c), glucose 
infusion rate (d) and GP (e). f, Suppression of GP during the clamp period 
expressed as the percentage decrease from basal GP (*P < 0.001 versus other 
groups). g, Glucose uptake was unaltered in all groups. Administration of 
intraduodenal Tri-C alone (n = 4) and saline (n = 5) were combined as one 
vehicle (VEH) group. VEH (n = 9), Lipid (n = 7), Lipid + Tri-C (n = 6). 

h, Under unclamped settings, intraduodenal lipid infusion lowered plasma 
glucose levels in 15 min (n = 5). This effect persisted throughout the 50 min 
infusion period (*P < 0.01 versus saline (VEH) (n = 5)). Intraduodenal 
lipid infusion for 15 min lowered plasma insulin levels (*P < 0.01 versus 
saline, *P < 0.05 versus pre-intraduodenal infusion). Values are shown as 
mean + s.e.m. 
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throughout the experiment to inhibit NMDA neurotransmission in 
the hindbrain (Fig. 3a, b). Intraduodenal lipid administration was 
initiated onehour after ICV administration of MK-801 into the 
fourth ventricle (Fig. 3b). Administration of ICV MK-801 into the 
fourth ventricle alone did not affect whole-body glucose metabolism 
but attenuated the effects of duodenal lipids to increase the glucose 
infusion rate and to lower GP (Fig. 3c—fand Supplementary Table 2). 
To explore the neuroanatomical location of the blockage effects on 
duodenal lipids in the hindbrain, we implanted bilateral catheters 
into the NTS (Fig. 3a). We infused MK-801 at a 15-fold-lower dose 
than the ICV administration into the fourth ventricle on each cath- 
eter into the NTS (0.03 ug min! at a rate of 0.006 pl min” '). NTS 
MK-801 alone did not affect whole-body glucose metabolism 
(Fig. 3b-f and Supplementary Table 2). However, NTS MK-801 
administration prevented the increase in glucose infusion rate and 
the reduction in GP induced by duodenal lipids (Fig. 3c—e). Glucose 
uptake (Fig. 3f), plasma glucose and insulin levels (Supplementary 
Table 1) were comparable in all groups. These results support the idea 
that NTS glutamatergic transmission relays afferent neuronal signals 
triggered by duodenal lipids to lower GP. 

To begin evaluating whether the hindbrain relays signals induced 
by upper intestinal lipids to the liver to lower GP, we repeated the 
intraduodenal lipid infusion protocol in combination with the 
clamp technique in rats that underwent vagotomy of the hepatic 
branch. Hepatic vagotomy negated the ability of intraduodenal 
lipids to increase glucose infusion rate and lower GP (Fig. 3c—e), 
and when performed alone did not affect whole-body glucose meta- 
bolism within two weeks (Fig. 3c-f and Supplementary Table 2). 
Glucose uptake (Fig. 3f) was comparable in all groups. These data 
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strengthen the working hypothesis that the upper-intestinal-lipid- 
induced neuronal signals are first processed by the NTS and are then 
relayed to the liver to lower GP. 

We next studied whether the activation of the intestine—brain— 
liver neural axis by administration of upper intestinal lipids is 
impaired in the early onset of diet-induced insulin-resistant rodents. 
Rats fed on a lard-oil-enriched diet rapidly increased their caloric 
intake (Supplementary Fig. 2a, b) and acquired brain defect(s) in 
detecting lipids to reduce GP after three days of high caloric intake”. 
We repeated the intraduodenal lipid administration protocol in 
combination with the clamp technique in this early-onset high-fat 
diet (HFD)-induced insulin-resistant model. In comparison with 
rats fed on a regular chow diet, intraduodenal lipid administration 
failed to increase glucose infusion rate and to lower GP in HFD rats 
(Fig. 3g—i). These data suggest that the metabolic effects of duodenal 
lipid are altered, leading to dysregulation of GP in the early onset of 
diet-induced insulin resistance and obesity. 

The body adapts to nutrient excess through multiple redundant 
nutrient-sensing mechanisms in various tissues. The small intestine 
and the brain have both been implicated in sensing nutrients and 
maintaining energy balance by downregulating food intake’*'?™. 
Here we report that an increase in the upper intestinal LCFA-CoA 
level suppressed GP and the plasma glucose level in the preabsorptive 
state in rodents. This was achieved by triggering an intestine—brain— 
liver neuronal network. In parallel, accumulation of LCFA-CoAs in 
the hypothalamus lowers GP and plasma glucose levels”. These data 
suggest that the small intestine and the brain have redundant LCFA- 
CoA-sensing mechanisms to maintain glucose homeostasis by inhi- 
biting GP. The caudal brainstem NTS seems to be a critical site on 
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Figure 2 | Upper intestinal lipids suppress glucose production through a 
neuronal network. a, Schematic representation of the working hypothesis. 
Duodenal lipid was co-infused with tetracaine, which abolishes the 
ascending neuronal signal to the brain. In separate studies, duodenal lipid 
was infused into rats that underwent subdiaphragmatic vagotomy (SVAG), 
gut vagal deafferentation (deafferent) or sham operation. b, Experimental 
procedure and clamp protocol. SVAG, deafferent or sham operation was 
performed about one week prior to the implantation of the duodenal and IV 
catheters. Rats were restricted to ~56 kcal of food one night before the 
pancreatic clamp. ¢, d, Intraduodenal lipid infusion increased the glucose 
infusion rate (c, *P < 0.01 versus other groups) and lowered GP 
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(d, *P < 0.001 versus all other groups) in normal (n = 2) and sham-operated 
(n= 5) rats. Rats receiving duodenal tetracaine administration, SVAG or 
deafferent failed to respond to upper intestinal lipids. e, Suppression of GP 
during the clamp period expressed as the percentage decrease from basal GP 
(*P < 0.001 versus other groups). f, Glucose uptake was unaltered in all 
groups. Intraduodenal tetracaine infusion alone (n = 4), saline-infused rats 
with SVAG (n = 5) and saline-infused rats with deafferent (n = 4) were 
combined as one VEH group. Values are shown as mean + s.e.m. VEH 

(n = 13), Lipid in normal and sham-operated rats (n = 7), Lipid + tetracaine 
(n = 6), Lipid + SVAG (n = 5), and Lipid + deafferent (n = 5). 
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which duodenal-lipid-stimulated vagal afferent signals act on to 
modulate GP. The ability of the NTS to mediate hypothalamic lipid 
signals that regulate GP, however, remains to be elucidated. The NTS 
could serve as a critical site of neural circuit convergence in integrat- 
ing nutrient-sensing signals in the body to negatively regulate GP. 
Additionally, future studies are needed to examine the potential 
cross-talk within brain regions (that is, NTS and the arcuate nucleus) 
in mediating intestinal-lipid-sensing mechanisms that regulate glu- 
cose homeostasis by means of the intestine-brain—liver neural axis. 
We report that the formation of upper intestinal LCFA-CoAs was 
required for upper intestinal lipids to lower GP. Although it still 
remains to be clarified whether enterocytes or enteroendocrine cells 
within the gut are mediating the effects of intestinal LCFA-CoA, our 
findings suggest that local intestinal biochemical events triggered by 
lipid administration are critical in mediating the lipid-induced 
suppression of GP. Intestinal hormones released by the presence of 
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nutrients in the intestinal lumen provide an attractive possible set 
of mediators of this GP-lowering response. For example, upper 
intestinal lipid infusions induce local release of the gastrointestinal 
satiety peptide cholecystokinin (CCK)*’. CCK binds to CCK-A 
receptors on gut vagal afferent neurons and increases neurophysio- 
logical activity in duodenal and gastric fibres that are also sensitive to 
two meal-related stimuli: nutrients and mechanical distension. 
Surgical and chemical interruption of gut vagal afferents attenuates 
the satiety effects of CCK and nutrient secretagogue of CCK, such as 
lipid*’. The potential involvement of CCK in mediating the lipid- 
induced intestine-hindbrain-liver neurocircuitry warrants future 
investigation in both rodents and humans. Furthemore, our data 
indicate that in the early onset of diet-induced insulin resistance, 
the intestine may have acquired defect(s) in lipid sensing, hindering 
glucose homeostasis regulation. These data may begin to shed light 
on the potential pathogenic effects of gut microbiota” and the 
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Figure 3 | Upper intestinal lipids suppress glucose production by activating 
an intestine-NTS-liver neurocircuitry. a, Schematic representation of the 
working hypothesis. NMDA channel inhibitor MK-801 was infused directly 
into the fourth ventricle (ICV 4th) or the NTS. b, Experimental procedure 
and clamp protocol. Stereotaxic surgeries were conducted 7 days before 
placing IV and duodenal catheters. All rats were restricted to ~56 kcal one 
night before the pancreatic-insulin clamp. c, d, Intraduodenal-lipid infusion 
increased the glucose infusion rate (¢, *P < 0.01 versus other groups) and 
lowered GP (d, *P < 0.001 versus all other groups) in rats with ICV infusion 
of saline into the fourth ventricle (n = 4) or into the NTS (n = 3). Rats 
receiving ICV infusion of MK-801 into the fourth ventricle (4th MK), MK- 
801 into the NTS (NTS MK) or HVAG failed to respond to upper intestinal 
lipids to increase glucose infusion rate and lower GP. e, Suppression of GP 
during the clamp period expressed as the percentage decrease from basal GP 


(*P < 0.001 versus other groups). f, Glucose uptake was unaltered in all 
groups. Intraduodenal-saline-infused rats with ICV infusion of saline into 
the fourth ventricle (n = 4), ICV infusion of MK-801 into the fourth 
ventricle (4th MK; n = 4), infusion of saline into the NTS (n = 4) and 
infusion of MK-801 into the NTS (n = 4) were combined as one VEH group. 
VEH (n= 16), Lipid + ICV fourth saline or Lipid + NTS saline (n = 7), 
Lipid + ICV fourth MK-801 (n = 5), Lipid + NTS MK-801 (n= 5). 

Saline + HVAG (n = 5), Lipid + HVAG (n = 5). g—i, During the pancreatic 
(basal insulin) clamp, rats that doubled their daily caloric intake for 

three days (HFD; n = 8) failed to respond to intraduodenal lipid infusion to 
increase glucose infusion rate (g) and lower GP (h, i) compared to rats fed on 
a regular chow (RC; n = 8). i, Glucose uptake was comparable in all groups. 
HED saline (VEH; n = 4), RC saline (VEH; n = 6). Values are shown as 
mean + s.e.m. 
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beneficial metabolic effects of gastric bypass surgeries in obesity and 
diabetes*®. 

The small intestine has developed lipid-sensing mechanisms to 
maintain energy balance by inhibiting food intake in both rodents 
and humans. Here we have provided the first evidence, to our know- 
ledge, that a selective increase in upper intestinal lipids lowers GP 
through an intestine—brain—liver neurocircuitry in rodents. This dis- 
covery may reveal novel sites and targets to lower GP in diabetes and 
obesity. 


METHODS SUMMARY 


Rats were subjected to cannulations in the duodenum, the internal jugular vein 
and the carotid artery for the pancreatic clamp in vivo infusion studies. Rats also 
received subdiaphragmatic vagotomy, gut vagal deafferentation, hepatic vagot- 
omy, or stereotaxic surgery that implanted a single catheter into the fourth 
cerebral ventricle (ICV fourth) or bilateral catheters into the NTS. The in vivo 
infusion experiments lasted a total of 200 min, with pancreatic clamps (insulin at 
0.8mUkg 'min™! and somatostatin at 3p1gkg ‘min ') initiated at 90 min 
and intraduodenal infusions began at 150 min. A different group of rats also 
received ICV fourth or NTS continuous administrations at 90 min. Upper 
intestinal and liver tissues as well as portal plasma samples were taken shortly 
after the clamps to assess lipids level. For the HFD studies, rats underwent 
duodenal/intravenous cannulations and were put on a lard-oil-enriched HFD 
to induce insulin resistance. After 3days of high caloric intake, the rats 
underwent the pancreatic clamp studies with intraduodenal infusions. Two 
additional sets of intraduodenal lipid/saline infusion experiments were per- 
formed for 15 min or 50 min in unclamped settings to evaluate the impact on 
plasma metabolite levels. All study protocols were reviewed and approved by 
the Institutional Animal Care and Use Committee of the University Health 
Network. 
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A deadenylation negative feedback mechanism 
governs meiotic metaphase arrest 


Eulalia Belloc! & Ratl Méndez! 


In vertebrate oocytes, meiotic progression is driven by the sequen- 
tial translational activation of maternal messenger RNAs stored in 
the cytoplasm. This activation is mainly induced by the cytoplas- 
mic elongation of their poly(A) tails, which is mediated by the 
cytoplasmic polyadenylation element (CPE) present in their 3’ 
untranslated regions'’. In Xenopus oocytes, sequential phase- 
specific translation of CPE-regulated mRNAs is required to acti- 
vate the maturation-promoting factor, which in turn mediates 
entry into the two consecutive meiotic metaphases (MI and 
MII)**. Here we report a genome-wide functional screening to 
identify previously unknown mRNAs cytoplasmically polyadeny- 
lated at meiotic phase transitions. A significant fraction of tran- 
scripts containing, in addition to CPEs, (A+ U)-rich element 
(ARE) sequences (characteristic of mRNAs regulated by deadeny- 
lation’) were identified. Among these is the mRNA encoding C3H- 
4, an ARE-binding protein that we find to accumulate in MI and 
the ablation of which induces meiotic arrest. Our results suggest 
that C3H-4 recruits the CCR4 deadenylase complex to ARE- 
containing mRNAs and this, in turn, causes shortening of 
poly(A) tails. We also show that the opposing activities of the 
CPEs and the AREs define the precise activation times of the 
mRNAs encoding the anaphase-promoting complex inhibitors 
Emil and Emi2 during distinct phases of the meiotic cycle. 


Taken together, our results show that an ‘early wave of cytoplas- 
mic polyadenylation activates a negative feedback loop by acti- 
vating the synthesis of C3H-4, which in turn would recruit the 
deadenylase complex to mRNAs containing both CPEs and 
AREs. This negative feedback loop is required to exit from meta- 
phase into interkinesis and for meiotic progression. 

Cytoplasmic polyadenylation requires two elements in the 3’ 
untranslated regions (UTRs) of responding mRNAs: the polyadeny- 
lation signal (Hex, also known as Hhex) and the nearby CPE, which 
recruits the CPE-binding protein (CPEB). However, the activation of 
CPE-containing mRNAs does not occur en masse at any one time (see 
Supplementary Fig. 1). Instead, the polyadenylation of specific 
mRNAs is temporally regulated** by two sequential phosphoryla- 
tions of CPEB. First, phosphorylation of CPEB by aurora A kinase® 
at the prophase I (PI)—MI transition induces a first wave of ‘early’ or 
Cdc2-independent cytoplasmic polyadenylation®, required for the 
PI-MI transition. Then, CPEB phosphorylation by Cdc2 at MI trig- 
gers a second wave of ‘late’ or Cdc2-dependent polyadenylation’, 
required for the MI—MII transition®’. However, a less understood 
third wave of polyadenylation takes place during interkinesis target- 
ing mRNAs (such as those encoding cyclin E'° and Weel, ref. 11) that 
have ‘late’ CPEs plus an additional U-rich sequence that has been 
defined as an embryonic CPE". Although neither the mechanism 
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Figure 1| Polyadenylation-induced translation of C3H-4 mRNA is required 
to exit MI. a, Northern blot of C3H-4 mRNA from untreated or 
progesterone (Prog.)-stimulated oocytes. Where indicated, the RNA was 
incubated with oligo(dT) and RNaseH. b, Oocytes were microinjected with 
C3H-4 antisense oligonucleotides (3’ C3H-4-as or 5' C3H-4-as), C3H-4 
sense oligonucleotides (C3H-4-s) or C3H-4-encoding mRNA (+ His—C3H- 
4mRNA) and were then treated with progesterone as indicated. Expression 


eM Ml pMI 


of C3H-4 was analysed by western blot. c, Oocytes were fixed and stained for 
chromosome visualization 6 h after GVBD. The MII metaphase plate (MII), 
polar body (PB) and the MI pseudometaphase plate (pMI) are indicated. 
Scale bars represent 20 jim. DN, dominant negative C3H-4 variant. d, The 
oocytes were collected at the indicated times after progesterone stimulation 
and analysed for Cdc2 kinase activity. 
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dictating this third wave of ‘late—late’ cytoplasmic polyadenylation 
nor its meiotic function have been defined, it could be required to 
establish the MII arrest. 

To identify the maternal mRNAs controlling meiotic progression 
and to elucidate how their translational regulation is temporally 
defined during the meiotic phase transitions, we designed a genome- 
wide functional screening for mRNAs polyadenylated in the course of 
the PI-MI and the MI-MII transitions (Supplementary Fig. 2a—c and 
Supplementary Tables 1 and 2). In addition to the Hex and the CPEs, 
57% of these UTRs contained class I AREs and 38% of the UTRs 
contained U-rich sequences resembling class III AREs or embryonic 
CPEs’’. Depletion of five of the newly identified mRNAs confirmed 
that their translation was required for meiotic progression (Supple- 
mentary Fig. 2e). Because more than half of the UTRs identified in 
our screening contained AREs, we focused our attention in C3H-4, 
an ARE-binding zinc-finger protein that is a member of the 
Tristetraprolin family of proteins'*’’. We first confirmed that the 
endogenous mRNA encoding C3H-4 was indeed deadenylated in 
PI-arrested oocytes and was subsequently polyadenylated in response 
to progesterone (Fig. la) in a Mos/Cdc2-independent manner 
(Supplementary Fig. 3). C3H-4 protein was synthesized in response 
to progesterone (Fig. 1b and Supplementary Fig. 4). These data indi- 
cate that C3H-4 is encoded by a maternal mRNA activated by ‘early’ 
cytoplasmic polyadenylation during the PI-MI transition. 

We next aimed to determine whether C3H-4 synthesis was 
required for meiotic progression beyond MI. Depletion of C3H-4 
(Fig. 1b) or overexpression of a C3H-4 dominant-negative variant 
induced partial chromosomal decondensation in the absence of a 
polar body (Fig. 1c) and abnormal morphology, consistent with a 
defect in the MI-MII transition (Supplementary Fig. 5). This pheno- 
type was rescued by overexpressing C3H-4 from a microinjected 
mRNA not targeted by the antisense oligonucleotide (Fig. 1b, c). 
To better define at which step of meiosis the C3H-4 antisense micro- 
injected oocytes were arrested, we performed a time course of 
Cdc2 activity (Fig. 1d). In the control oocytes, Cdc2 activity sharply 
decreased after germinal vesicle breakdown (GVBD) and increased 
again at MII, whereas, in the C3H-4-depleted oocytes, Cdc2 activity 
was maintained at high levels during the whole time course. Oocyte 
maturation and Cdc2 activation took place about an hour earlier in 
C3H-4-depleted oocytes than in control oocytes (data not shown). 
Altogether, these results show that C3H-4 depletion accelerates 
maturation and induces meiotic arrest in a pseudo-metaphase 
state’®’” during the first meiotic division. We conclude that C3H-4 
opposes maturation-promoting factor (MPF) activation and is 
required for exit of the first meiotic metaphase. 

We next sought to determine whether C3H-4 could mediate 
deadenylation of ARE-containing mRNAs. Indeed, a polyadenylated 
fragment of the tumour necrosis factor « (TNF-x) 3’ UTR, which 
contains well characterized AREs and binds C3H-4 (ref. 15), was 
rapidly deadenylated on MI entry, concomitant with C3H-4 syn- 
thesis (Fig. 2a and Supplementary Fig. 6). Ablation of C3H-4 
mRNA indicated that TNF-« deadenylation requires C3H-4 
(Fig. 2a). We next aimed to determine whether C3H-4 recruited a 
deadenylase to the ARE-containing mRNAs. For this, we focused in 
the CCR4—Not deadenylation complex, which is known to mediate 
deadenylation of cell-cycle-related ARE-containing mRNAs in verte- 
brates'*”°. Indeed, CCR4 co-immunoprecipitated with C3H-4, and 
vice versa, in extracts from MII oocytes (Fig. 2b). Because C3H-4 
was detected as a doublet in MII (Fig. 2b), we tested whether this 
protein was phosphorylated in the course of meiosis, suggesting also 
a post-translational regulation of its activity, in agreement with the 
fact that we did not detect any clear effect of overexpressing C3H-4 
(data not shown). Indeed, overexpressed tagged C3H-4 displayed 
slower gel mobility in MI than in PI and an intermediate mobility 
in MII (Fig. 2c)—an effect that was reverted by treatment with alkaline 
phosphatase (Supplementary Fig. 7). When C3H-4 is used as substrate 
in an in vitro phosphorylation assay with extracts from oocytes at 
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different meiotic stages as the source of kinase, we detected phosphor- 
ylation of C3H-4 in Pland MI and hyperphosphorylation, shown as a 
slow-migrating band, in MI (Fig. 2d). These observations, together 
with the fact that CCR4 seems to co-immunoprecipitate mainly the 
slow-migrating form of C3H-4, could reflect a post-translational 
activation of C3H-4 in MI and inactivation for MII entry. 

To identify C3H-4-regulated mRNAs, we used a candidate 
approach. Because C3H-4 mediates deadenylation in MI, and therefore 
translational inhibition, we searched for ARE- and CPE-containing 
mRNAs encoding proteins that, when overexpressed in MI, caused a 
similar meiotic arrest. Two of the best candidates were the anaphase- 
promoting complex (APC) inhibitors emil and emi2. Emil is transi- 
ently expressed at low levels during the PI-MI transition, and its 
degradation in MI is required for progression beyond prometa- 
phase-I (refs 20-22). Emi2 is synthesized during interkinesis**** and 
is required for the establishment and maintenance of the CSF”. 
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Figure 2 | C3H-4 mediates ARE-dependent deadenylation in oocytes by 
recruiting the CCR4 deadenylase to the mRNAs. a, Oocytes injected with 
either C3H-4 sense (s) or C3H-4 antisense (as) oligonucleotides were 
microinjected with a radiolabelled polyadenylated TNF-« RNA probe, 
incubated in the absence or presence of progesterone (Prog.), and then total 
RNA was extracted and analysed by gel electrophoresis followed by 
autoradiography. b, Cytoplasmic extracts from mature oocytes, or the 
corresponding fractions immunoprecipitated (IP) with anti-C3H-4 and 
anti-CCR4 antibodies or control immunoglobulin (Ig)Gs, were analysed by 
western blot with either C3H-4 or CCR4 antibodies. ¢, His-C3H-4-mRNA- 
microinjected oocytes not treated with progesterone (—P), or treated and 
collected at the indicated times after maturation, were analysed by western 
blot for C3H-4. d, In vitro kinase assays of oocyte extracts collected at the 
indicated times and using either GST—C3H-4 or histone H1 as substrates. 
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Premature expression of Emi2 at MI causes MI arrest****. The emil 
3' UTR contains five potential ‘early-weak’ CPEs® and multiple poten- 
tial AREs. The emi2 3’ UTR contains four ‘late-strong’ CPEs* and 
multiple potential AREs (Supplementary Fig. 8a). A recent report™* 
shows that the 3’ UTR of emi2 is polyadenylated in a CPE-dependent 
manner. 

To test whether emil and emi2 mRNAs could be regulated by 
cytoplasmic changes in their polyadenylation status during meiosis, 
we compared the polyadenylation kinetics of microinjected labelled 
probes for the 3’ UTRs of emil, emi2, cyclin B1 (which contains 
strong-late CPEs* but no AREs) and C3H-4 (which contains 
strong-early CPEs* and AREs; Supplementary Fig. 8a). The emil 3’ 
UTR was weakly polyadenylated during the PI-MI transition and was 
sharply deadenylated between MI and MII. Cyclin B1 3’ UTR was 
already maximally polyadenylated at MI, whereas emi2, which 
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Figure 3 | The polyadenylation profile is defined 
by the CPE arrangement and the presence of 
AREs. The indicated radiolabelled RNAs were 
microinjected into oocytes, and then incubated in 
the presence or absence (—P) of progesterone. At 
the indicated times after maturation (min), the 
oocytes were collected and total RNA was 
extracted and analysed by gel electrophoresis 
followed by autoradiography. Cdc2 kinase 
activity at each point was determined as in Fig. 1. 
The percentage of polyadenylated RNA is 
indicated. 
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contains a similar arrangement of CPEs, was not maximally poly- 
adenylated until interkinesis. C3H-4 was partially polyadenylated in 
MI and fully polyadenylated in MII (Fig. 3). These data suggest that 
the specific arrangement of CPEs combined with the presence or 
absence of AREs defines specific patterns of poly(A) tail length during 
meiosis. 

To determine whether the emi! and emi2 mRNA-specific patterns 
of polyadenylation were indeed defined by CPEB and C3H-4, we first 
tested whether U-labelled probes from emil and emi2 3' UTRs were 
able to bind these proteins. As controls, we used probes derived from 
TNF-a 3' UTR (which contains AREs but no CPEs) or cyclin Bl 
3’ UTR (Supplementary Fig. 8a). Ultraviolet crosslinking of cytoplas- 
mic extracts from C3H-4-overexpressing oocytes, followed by C3H-4 
immunoprecipitation or CPEB purification, showed that emil, emi2 
and TNF-«, but not cyclin Bl, probes bound C3H-4 (Fig. 4a). In 
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Figure 4 | The polyadenylation of emi7 and emi2, CPE- and ARE-containing 
mRNAs, is regulated by CPEB and C3H-4. a, The indicated radiolabelled 
RNAs were incubated in oocyte extracts overexpressing C3H-4, were 
ultraviolet-crosslinked and were then immunoprecipitated with anti-C3H-4 
antibody or control IgGs. The immunoprecipitates were analysed by 
SDS-PAGE followed by autoradiography. b, The same RNA probes and 
ultraviolet-crosslinked extracts were subjected to sepharose 
chromatography and analysed as in a. ¢, Cytoplasmic extracts from MI 
oocytes were immunoprecipitated with anti-CPEB, anti-C3H-4 and 


—— 


control-IgG antibodies. The immunoprecipitates were analysed for the 
indicated mRNAs by RT-PCR. d, Oocytes microinjected with 3’ C3H-4 
antisense oligonucleotides (C3H-4-as) or C3H-4 sense oligonucleotides 
(C3H-4-s) were treated with progesterone and collected at metaphase I (MI) 
or metaphase II (MID), or were left untreated (— P). Total RNA was extracted 
and the polyadenylation status of the indicated mRNAs measured by RNA- 
ligation-coupled RT-PCR. e, Schematic diagram showing the sequential 
waves of polyadenylation and deadenylation driving meiotic progression. 
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contrast, CPEB was crosslinked to emil, emi2 and cyclin B1, but not 
to TNF-a, probes (Fig. 4b). 

We next sought to determine whether the endogenous mRNAs 
were also associated to C3H-4 and CPEB by immunoprecipitation 
with either C3H-4 or CPEB antibodies followed by polymerase chain 
reaction with reverse transcriptase (RT-PCR) for each mRNA. 
Indeed, the endogenous emi1, emi2 and cyclin B1 mRNAs were assoc- 
iated with CPEB in MI oocytes (Fig. 4c), whereas emi1 and emi2, but 
not cyclin B1, mRNAs were bound to C3H-4 (Fig. 4c). Thus, we 
conclude that emil and emi2 mRNAs recruit CPEB and C3H-4. To 
determine whether poly(A) tail length of emil and emi2 mRNAs was 
controlled by C3H-4, we analysed their polyadenylation in either 
control oocytes or C3H-4-depleted oocytes (Fig. 4d). In control 
oocytes, emil mRNA was deadenylated in PI, polyadenylated in MI 
and deadenylated again in MII, whereas, in C3H-4-depleted oocytes, 
emil was not deadenylated in MII but was further polyadenylated. In 
the same control oocytes, emi2 mRNA was also deadenylated in PI 
but maximum polyadenylation was not detected until MII, whereas, 
in the absence of C3H-4, the mRNA was already polyadenylated in 
MI. Cyclin E mRNA, another component of the CSF"°, which has a 
similar arrangement of CPEs and AREs to emi2, displayed the same 
polyadenylation pattern as that of emi2 mRNA in the presence and 
absence of C3H-4. As expected, the polyadenylation of cyclin B1 
mRNA was not affected by the depletion of C3H-4. Thus, for emil, 
the weak ‘early’ polyadenylation at the PI-MI transition was reverted 
once C3H-4 is synthesized in MI, whereas, for emi2 and cyclin E, the 
strong ‘late’ polyadenylation at MI was displaced to interkinesis by 
C3H-4, probably as the result of the opposing activities of CPEB and 
C3H-4 in MI. Accordingly, Emi2 protein synthesis in oocytes expres- 
sing C3H-4 takes place during interkinesis/MII, whereas, in the 
absence of C3H-4, Emi2 is already expressed at high levels in MI 
(Supplementary Fig. 8b), explaining therefore the MI arrest caused 
by preventing C3H-4 mRNA translation. 

Taken together, our results show that the ‘early’ wave of cytoplas- 
mic polyadenylation activates a negative feedback loop (which 
opposes CPEB activity on mRNAs containing both CPEs and 
AREs) by activating the synthesis of C3H-4. The effect of the C3H- 
4-mediated deadenylation on the target mRNAs is in turn modulated 
by the different arrangements of CPEs. Thus, for an mRNA poly- 
adenylated by the ‘early’ activation of a weak CPE, C3H-4 inactivates 
the mRNA after MI. However, for mRNAs containing a late-strong 
CPE arrangement, C3H-4 causes a delay in the poly(A) tail elonga- 
tion, generating a third wave of polyadenylation in interkinesis. 
These sequential waves of polyadenylation and deadenylation drive 
meiotic progression (Fig. 4e). Thus, ‘early’ polyadenylation of mos, 
cyclins B2 and B5 and emi1 would be required to activate and sustain 
MPF activity in MI, whereas ‘late’ polyadenylation of cyclins B1 and 
B4 would be required to sustain an intermediate MPF activity during 
interkinesis and for the reactivation of MPF in MII. In addition, 
activation of emi2 and cyclin E during interkinesis is required to 
generate CSF. Conversely, ‘early’ polyadenylation-driven translation 
of C3H-4 generates a deadenylation wave that inactivates emi1 trans- 
lation in MI allowing the transition to interkinesis, and delays Emi2 
and cyclin E synthesis to late interkinesis preventing premature 
activation of CSF. Sequential waves of polyadenylation and dead- 
enylation ensure irreversible, self-sustained, meiotic phase transi- 
tions by controlling discrete states of MPF, APC and CSF activities. 
The bistability of this process is sustained by multiple translation- 
dependent positive and negative feedback loops that help to keep the 
oocyte from slipping rapidly back and forth between cell cycle 
phases**°°, 


METHODS SUMMARY 


Oocyte manipulation and microinjection. Stage VI oocytes were obtained 
from the ovaries of adult Xenopus females as described previously’. Matura- 
tion was induced by incubating oocytes in Barth’s medium containing 10 1M 
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progesterone. DNA staining to assess meiotic progression was performed as 
described*’, and the Cdc2 kinase activity was assayed as described®. 

Antisense oligonucleotide and rescue experiment. To ablate the expression of 
C3H-4 mRNA, 18-mer oligonucleotides targeting either the 5’ UTR or the 3’ 
UTR were designed; the complementary sequence of one of them was used as a 
control. In each oocyte, 98 ng oligonucleotides was injected. After overnight 
(16h) incubation at 18 °C, progesterone was added as described. For the rescue 
experiment, the 5’ C3H-4 antisense injected oocytes were used followed by 
injection of 0.03 pmol of in vitro transcribed RNA coding for the ORF and the 
3’ UTR of C3H-4 mRNA. See Methods for further details of the oligonucleotide 
sequences. 

RNA-ligation-coupled RT-PCR. This technique was performed as described** 
with minor modifications. See Methods for detailed modifications. 
Antibodies and immunoprecipitations. The CPEB antibody was a gift from J. D. 
Richter, the dCCR4 antibody was a gift from E. Wahle and the Emi2 antibody 
was a gift from T. U. Mayer. The ultraviolet crosslinking followed by immuno- 
precipitations were performed as described**, using 50 fmol of emil and emi2 
labelled RNAs and 100fmol of TNF-x and cyclin B1 labelled RNAs. 
Immunoprecipitations followed by RT-PCR were performed as described* 
(using fresh MI oocyte lysates, 50 oocytes per condition) with the C3H-4 and 
CPEB antibodies. The protein-bound RNAs were purified by proteinase K diges- 
tion followed by phenol-chloroform extraction. 

Polyadenylation and deadenylation assays. These assays were carried out as 
described in ref. 8. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Purification of poly(A)(+80) RNA with poly(U)-sepharose. A Xenopus 
oligo(dT)-primed complementary DNA oocyte library was in vitro transcribed 
with the T3 mMessage mMachine kit (Ambion), and then the RNAs were treated 
with RNase H and oligo(dT) to remove their poly(A) tail. These RNAs were 
microinjected (46nl per oocyte) into stage VI oocytes that were treated with 
progesterone, and, when mature, total RNA was extracted with Ultraspec 
(Biotecx) and the poly(A)(+80) RNA fraction was purified by poly(U) chro- 
matography. Total RNA was incubated with sample buffer (0.01 M Tris-HCl, 
pH7.5, 1mM EDTA, 1% SDS) for 5 min at 65 °C, the sample was chilled on ice 
and binding buffer was added (0.05 M Tris-HCl, pH 7.5, 0.7M NaCl, 10 mM 
EDTA, 25% (v/v) formamide), and then the sample was loaded into the poly(U)- 
agarose (Sigma) chromatography column and incubated for 30 min at room 
temperature (22—23 °C) with agitation; the column had been previously washed 
with swelling buffer (0.05 M Tris-HCl, pH 7.5, 1 M NaCl) and packed. Next, the 
column containing the sample was washed three times at 25 °C and three times at 
55 °C with washing buffer (0.05 M Tris-HCl, pH 7.5, 0.1 M NaCl, 10 mM EDTA, 
25% formamide) and the remaining poly(A)(+80) RNA was eluted with elution 
buffer (0.05 M HEPES, pH 7, 10 mM EDTA, 90% (v/v) formamide). The eluted 
RNA was precipitated after phenol—-chloroform extraction. Reverse transcrip- 
tion (RevertAid M-MuLV retrotranscriptase, Fermentas) of the purified RNA 
was performed with 3’ Race oligonucleotides (TAATACGACTCACTATAGGG- 
CGGATCCTITTTTTTTTTTITITITTTTITTTVN), and then PCR (Ecotaq 
polymerase, Ecogen) with T7-antisense (GTAATACGACTCACTATAGGGC) 
and T3zorn (GGAAATTAACCCTCACTAAAGGGCAGAATAAACGCTCAAC- 
TTTGGC) primers was performed to amplify the library. To clone the products 
into a pBSK vector, the PCR products were digested with BamHI and EcoRI. 

We sequenced the 253 clones from the early library and 246 of the 2,620 clones 
from the late library. After correcting for redundancy, we found 56 UTRs from 
‘early’ polyadenylated mRNAs and 55 from mRNAs polyadenylated in MI. 
Antisense oligonucleotides for meiotic progression screening. The following 
antisense oligonucleotides were used: hsp90-antisense (GGTGCAGAGAAA- 
GGATAC); pp2c-antisense (CCAATCCCCCCTCCAATG); X71067-antisense 
(AATTTTCGATAGTCCCTC); and tyrP4a2-antisense (GGTGGGGGCAATGC- 
TTAG). 

RNA-ligation-coupled RT-PCR. Total oocyte RNA (4 1g) from pools of six 
oocytes was ligated to 0.4ug SP2 anchor primer (5’-P-GGTCACCTCT- 
GATCTGGAAGCGAC-NH)-3’) in a 10 ul reaction using T4 RNA ligase (New 
England Biolabs) according to the manufacturer’s directions. The whole 10 pl 
RNA ligation reaction was used in a 50 reverse transcription, using the 
RevertAid M-MuLV retrotranscriptase (Fermentas) according to manufac- 
turer’s directions with 0.4 ug ASP2T (GICGCTTCCAGATCAGAGGTGACC- 
TTTTT) as the reverse primer. One microlitre of this cDNA preparation was 
used in each 25 pil PCR reaction using EcoTaq polymerase (Ecogen) and [**P] 
dATP to label the product. The specific primers used were emil-sense 
(GCAAACAGAATTTACGGAGGTTATAG), emi2-sense (GGGGCTTTGGGG- 
GTGTGTATGATCCG), 3’ C3H-4-sense (GCCAGCAGTTGAGAAGCC), cyclin 
Bl (GGAGATCTTGTTGGCACCATGTGCTTC) and cyclin E (GGAGGATG- 
CTACTTGATGGAGAT). The PCR products were resolved in polyacrylamide 
gels and autoexposed. 

Cloning and mutagenesis. C3H-4 (Gene Bank accession number AF061983) 
cDNA was cloned by RT-PCR (using the RevertAid M-MuLV Reverse 
Transcriptase from Fermentas for RT, and Taq plus precision polymerase from 
Stratagene for PCR) with C3H-4-sense (GGGAATTGCGAGATATCAAA- 
TGACAGTCTGG) and C3H-4-antisense (GGGTCGACGTATAAAGTTAGCC- 
GAGGACAG) primers, from total Xenopus oocyte RNA. The PCR product was 
cloned into a pGEX4T-3 expression vector, and into pBSK + pA73 vector for 
in vitro transcription. 

The TNF-a construct was generated from the IMAGE clone 718000 GenBank 
accession number AA261175 by PCR with the TNF-sense (GGAGATCTCAA- 
TGCACAGCTTTCCTCAC) and the TNF-antisense (CCGGATCCTTCCAA- 
CACTGGGTCTTCAG) oligonucleotides. The PCR product was cloned into a 
plasmid after the T3 promoter for transcription and before a poly(A) tail of 73 
nucleotides to obtain a TNF-up(A)73 RNA. 

The emil (AF319594) and emi2 (AF450296) 3’ UTRs were cloned into pBSK 
by performing a PCR from the IMAGE clone 6641423 GenBank accession 
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number BU914851 and IMAGE clone 3405254 GenBank accession number 
BG486873, respectively, with specific primers containing the Smal restriction 
site in the sense primer and the EcoRI restriction site in the antisense primer. The 
plasmid and the PCR products were digested with Smal and EcoRI and were 
ligated. 
Overexpressed C3H-4 Xenopus oocytes extracts. Five-hundred stage VI 
oocytes were injected with 100 ng of in vitro transcribed RNA with the coding 
sequence of C3H-4 and a poly(A) tail of 73 nucleotides. After overnight incuba- 
tion at 18 °C, the oocytes were mixed with other 3,000 non-injected oocytes, and 
then the oocytes were rinsed twice with extract buffer (10 mM HEPES-KOH, 
pH7,5, 100mM KCl, 1mM MgCl,:6H20O, 50 mM sucrose supplemented with 
Complete-EDTA free protease inhibitors from Roche). The oocytes were packed 
at 2,000g for 5 min, the excess of buffer was removed, and the sample was 
centrifuged at 13,562g with maximum acceleration and no-brake deceleration 
in a Beckman Coulter OptiMax Ultracentrifuge, from which the cytoplasmic 
fraction was recovered. 
Antisense oligonucleotide and rescue experiments. To ablate the expression of 
specific mRNAs, we designed 18-mer oligonucleotides targeting either the 
5’ UTR or the 3’ UTR of C3H-4 mRNA: 5’ C3H-4-antisense (5’-CCAGCTTT- 
GACTCCCCAT-3’), 3’ C3H-4-antisense (5'-GGCTTCTCAACTGCTGGC-3’) 
and the complementary sequence as a control 3’ C3H-4-sense (5’-GCCAGC- 
AGTTGAGAAGCC-3’). In each oocyte, 98 ng of oligonucleotides was injected. 
After overnight incubation at 18 °C, progesterone was added as described. For 
the rescue experiment, after overnight incubation of 5’ C3H-4 antisense-injected 
oocytes, 0.03 pmol of in vitro transcribed RNA coding for the ORF and the 
3’ UTR of C3H-4 mRNA were injected into the oocytes, and 1h later progester- 
one was added. 
Antibodies and immunoprecipitations. C3H-4 antibodies were raised in rab- 
bits against the C3H-4 124_137 peptide (CSLPPRRYGGPYRER) and were affin- 
ity-purified with agarose beads (MicroLink Protein Coupling kit, Pierce) 
coupled to the immunogenic peptide, as described by the manufacturer. 
Ultraviolet crosslinking followed by immunoprecipitations were performed as 
described*’, using 50 fmol of emil and emi2 labelled RNAs and 100 fmol of TNF- 
a and cyclin B1 labelled RNAs. For the purification of CPEB in Fig. 4b, we took 
advantage of its ability to bind sepharose in low-stringency conditions; ultra- 
violet-crosslinked extracts were incubated with sepharose beads in NET-N buffer 
(20 mM Tris-HCl, pH 8, 1mM EDTA, 100mM NaCl, 0.5% NP-40) for 2h at 
4°C and then washed with NET-N 175 mM NaCl and eluted with sample buffer 
(200 mM Tris-HCl, pH 6.8, 40% glycerol, 8% SDS, 20mM DTT). Immuno- 
precipitations followed by RT-PCR were performed as described™ with fresh 
MI oocyte lysates (50 oocytes per condition), C3H-4 affinity-purified antibody 
and CPEB antibody. The protein-bound RNAs were purified by proteinase K 
digestion followed by phenol-chloroform extraction. Half of the total RNA 
extracted was used for the retrotranscription, performed with the 3’ Race primer 
(TAATACGACTCACTATAGGGCGGATCCTTTTTTTTTTTTTTTTTTTTTT- 
TTTVN) with the mMLuV reverse transcriptase from Fermentas following the 
manufacturer’s instructions. A twentieth part of the cDNA was used for each 
specific PCR (EcoTaq polimerase, Ecogen) with a common T7-antisense primer 
(GTAATACGACTCACTATAGGGC) and with the following specific primers; 
emil-sense (GCAAACAGAATTTACGGAGGTTATAG), emi2-sense (GGGGC- 
TITGGGGGTGTGTATGATCCG) and cyclin B1 (GGAGATCTTGTTGGC- 
ACCATGTGCTTC). 
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Crystal structure of the 1 repressor and a model for 
pairwise cooperative operator binding 


Steven Stayrook', Peera Jaru-Ampornpan’f, Jenny Ni’, Ann Hochschild” & Mitchell Lewis’ 


Bacteriophage 4 has for many years been a model system for 
understanding mechanisms of gene regulation’. A ‘genetic switch’ 
enables the phage to transition from lysogenic growth to lytic 
development when triggered by specific environmental condi- 
tions. The key component of the switch is the cl repressor, which 
binds to two sets of three operator sites on the 1 chromosome that 
are separated by about 2,400 base pairs (bp)”*. A hallmark of the 1 
system is the pairwise cooperativity of repressor binding*. In the 
absence of detailed structural information, it has been difficult to 
understand fully how repressor molecules establish the coopera- 
tivity complex. Here we present the X-ray crystal structure of the 
intact 2 cl repressor dimer bound to a DNA operator site. The 
structure of the repressor, determined by multiple isomorphous 
replacement methods, reveals an unusual overall architecture that 
allows it to adopt a conformation that appears to facilitate pairwise 
cooperative binding to adjacent operator sites. 

The binding of 1 repressor to operator regions Op and O; on the 
phage chromosome is required during lysogenic growth, when the 
integrated prophage replicates as part of the host chromosome. This 
binding serves two vital functions: it prevents transcription of the 
viral genes necessary for lytic growth and stimulates transcription of 
the cl gene (encoding the repressor). To simultaneously repress and 
activate transcription, a pair of repressor dimers binds cooperatively 
to adjacent operator sites in Op, thereby preventing transcription 
from lytic promoter Px and stimulating transcription from lysogenic 
promoter Pay. Similarly, cooperative binding to adjacent operator 
sites in Oy prevents transcription from lytic promoter P;. Finally, 
long-range interactions between the cooperatively bound repressor 
dimers at Og and O;, directly link the regulatory events occurring at 
these two loci’. 

Although detailed structural information is available for the 
isolated domains of the cl repressor®’, the intact molecule has not 
been structurally characterized. To understand more completely how 
this repressor functions and to better appreciate the structural basis 
of cooperativity, we have determined the structure of an intact 
dimeric repressor bound to a specific 17-bp operator sequence. 
The repressor—operator complex was crystallized using a repressor 
molecule that bears the single amino-acid substitution D197G. This 
mutant repressor forms a functional dimer that binds to single 
operator sites with the same affinity as the wild-type repressor but 
is incapable of participating in the dimer—dimer interactions that 
mediate cooperative DNA binding®. The crystal structure was deter- 
mined by multiple isomorphous replacement methods; although the 
crystals diffract only to a modest resolution (3.9 A), the connectivity 
was readily apparent (Supplementary Fig. 1). The excellent quality 
of the experimentally phased maps allowed us to position the 
amino (N)-terminal domains (NTDs) and carboxy (C)-terminal 
domains (CTDs) with extremely high certainty and to establish the 


connectivity of the polypeptide chain between these two domains. 
Because of the limited resolution of the diffraction data and the 
absence of strong side-chain information, the model of the intact 


Figure 1| Schematic representations of the dimeric repressor bound to the 
operator. One subunit is coloured salmon (CSR, red), the other subunit is blue 
(CSR, dark blue). a, Cartoon representation of the repressor bound to the 
operator. The NTD (N) interacts with the operator DNA, and the CTD (C) self- 
associates and forms the dimer contacts. The two domains are connected by a 
linker that tethers the NTD to the CTD. b, Relative orientation of the NTDs and 
CTDsas observed in the crystal structure, as well as the positions of the CSRs. The 
radii of the spheres were determined based on the actual size of the domains; the 
centre of each sphere corresponds to the domain’s centre of mass. The blue NTD 
and CTD associate through the CSR. In contrast, the salmon NTD and CTD do 
not interact with one another. c, The complex is rotated by 90° and viewed down 
the axis of the DNA. In this orientation it is clear that the CSR stabilizes the 
interactions between the blue NTD and CTD (see also Fig. 2c). d, The complex is 
rotated by 90° with respect to c or 180° with respect to b. This orientation 
provides a view of the CSR for the salmon-coloured subunit and illustrates that 
the NTD and the CTD of this molecule are not in contact. Also shown are the 
C-terminal helices from the NTDs, to demonstrate the connectivity. 
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repressor was refined using tight structural restraints (see 
Supplementary Data). 

The structure of A repressor is conceptually consistent with the 
textbook cartoons that depict it as a dumbbell’ (Fig. 1a). The full- 
length repressor is composed of two structurally distinct domains 
that are tethered by a protease sensitive connector. The NTD (resi- 
dues 1-92), which contains a helix—turn—helix DNA-binding motif, 
is a compact o-helical domain that weakly self-associates to form a 
dimer. Dimers of the NTD recognize and bind to the operator 
sequences using this helix—turn—helix motif®'®. The CTD (residues 
132-236) is a highly twisted B-sheet structure that is responsible for 
establishing the essential dimer contacts and for mediating the 


Figure 2 | Stereo and space-filling views of the dimeric repressor bound to 
an operator fragment. Although the NTDs and the CTDs are related by two- 
fold axes of symmetry, the intact repressor does not have a local dyad. The 
symmetry axes of the individual domains are shown as green rods (a, b). The 
CSRs are coloured dark blue (blue subunit) and red (salmon subunit). a, The 
repressor is oriented such that the principal axis of the DNA is in the plane of 
the page. The two-fold axis of the NTDs is essentially in the plane of the page, 
whereas the dyad axis of the CTDs is perpendicular to the page. The blue 
repressor subunit is more compact and the salmon subunit is elongated. 

b, The complex is rotated 90° and viewed down the major axis of the DNA. 
This orientation illustrates the asymmetry and that the blue CSR bridges the 
NTD and the CTD of this subunit. The two-fold axis of the CTDs also relates 
the red and blue CSRs. ¢, Space-filling representations of the dimeric 
repressor related by a 180° rotation. Views illustrate that the CSR of the blue 
molecule (dark blue) creates a bridge between the NTD and the CTD (right 
image) and that the CSR of the salmon-coloured molecule (red) interacts 
only with the salmon-coloured CTD (left image). 


LETTERS 


higher-order dimer—dimer interactions that underlie cooperative 
binding to the DNA’. In addition, the CTD performs a self-cleavage 
reaction, which is triggered when the lysogenic cell suffers DNA 
damage and depends upon an activated form of the bacterial RecA 
protein'’*. This self-cleavage reaction inactivates the repressor by 


Cooperativity =e 


Cooperativity 
interface 


Figure 3 | ) Repressor binds cooperatively to adjacent operator sites. A 
model of the cooperativity complex was created by superimposing two intact 
dimers onto the structure of the CTD tetramer. a, Looking down on the 
cooperativity complex shows that the CTDs of the repressor dimers associate 
into a tetramer, as determined from previous crystal structures. The arrows 
draw attention to the cooperativity interface. Note that the amino-acid 
residues that mediate cooperativity comprise two discrete patches on the 
CTD surface (that is, four on the surface of each CTD dimer). When two 
dimers associate to form the cooperativity complex, only two of the four 
patches on each dimer become buried at the interface; the two remaining 
patches on each dimer (dark blue and dark pink) remain exposed to solvent, 
offering potential sites for further association (Fig. 4). b, The model of the 
cooperativity complex was rotated approximately 90° (about a horizontal 
axis) from the orientation shown in a. The model illustrates how two 
repressor dimers might bind to adjacent operator sites. The inter-operator 
spacing (centre-to-centre) in this model is about 22 bp, whereas the inter- 
operator spacing for adjacent operators in Op (and for O,1 and O;2) is either 
23 or 24bp. ¢, The cooperativity complex rotated 90° (about a vertical axis) 
from the orientation shown in b and viewed down the major axis of the DNA 
to illustrate that the NTDs bind laterally to the DNA, whereas the CTDs sit 
medially. 
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separating the NTD from the CTD. The connector (residues 93-131), 
which contains the cleavage site, consists of a small protease sensitive 
linker (residues 93-101) and the cleavage site region (CSR). 
Structurally, the CSR is an integral part of the CTD, forming a pair 
of antiparallel B-strands that drapes across its surface (see below). 
Cleavage occurs at a specific site (between Alal11 and Gly112) within 
a long loop (residues 106—126; the cleavage site loop) that connects 
the antiparallel B-strands of the CSR. 

The most remarkable aspect of the intact dimeric structure is its 
lack of global symmetry. Unlike the cartoon representations, the 
structure reveals that the intact repressor does not have a single axis 
of symmetry. The two-fold axes that relate the NID and CTD dimers 
are substantially skewed and are nearly orthogonal (Figs 1b-d and 
2a, b). The interface between the two monomers is extensive, with 
about 2,200 A? of buried surface area per monomer, of which less 
than one-half (about 900 A’) i is due to the CTD—dimer interface. In 
this crystal structure, the NID and CTD of one subunit of the dimer, 
but not the other, are in close contact, creating an interface that also 
buries about 900A” of surface area per monomer (blue subunit, 
Figs 1b-d and 2). For the most part the interactions between the 
NTD and the CTD are localized to the CSR, which provides the ‘glue’ 
that holds the NTD relative to the CTD (Fig. 2). In a space-filling 
representation of the dimer, this subunit of the repressor more 


NATURE| Vol 452|24 April 2008 


closely resembles a cylinder than two discrete spheres (Fig. 2c). 
There are also interactions between the CSR of this subunit and the 
NTD of the dimer-related subunit (Fig. 2c), corresponding to about 
350 A? of buried surface area. As a consequence of this asymmetric 
dimer assembly, the two short linkers (residues 93-101) that connect 
the N-terminal and C-terminal domains have very different confor- 
mations. Although the asymmetric dimer assembly accounts for the 
pattern of cooperative repressor binding to adjacent operator sites 
(see below), we cannot exclude the possibility that crystal packing is 
responsible for this asymmetry. Additional experiments with mutant 
proteins might enable us to eliminate this possibility definitively. 
The active site for the self-cleavage reaction is located in a shallow 
groove on the surface of the A repressor CTD’. It comprises two 
essential residues (Ser149 and Lys192) that are strictly conserved 
among related self-cleaving repressors, including the bacterial repres- 
sor LexA”’. In the case of LexA, a conformational equilibrium was 
shown to govern the rate of the self-cleavage reaction", and struc- 
tures revealed the two conformations: one in which the cleavage loop 
folds over the active site (with the Ala—Gly bond appropriately 
positioned for cleavage) and one in which the scissile bond is about 
20A from the catalytic centre'®. The CSR in the structure of the 
intact 4 repressor dimer adopts a conformation that resembles that 
of the LexA CSR in the non-cleavable conformation (Supplementary 


Figure 4 | Models of the intact repressor octamer reveal the structural 
basis for alternate pairwise cooperativity. a, Interaction of two DNA- 
bound tetramers. The model, which was created by superimposing two 
DNA-bound tetramers (see Fig. 3) on a structure of the ) repressor CTD 
octamer, illustrates why three interacting repressor dimers (shown in dark 
colour) cannot bind to three adjacent operators (Og1, Op2 and OR3, for 
example). Each repressor dimer has two potential cooperativity interfaces’®, 
allowing it to interact with up to two additional dimers (consider, for 
example, the dimer at the bottom left, which is shown interacting with both 
the dimer at the bottom right and the darkly coloured dimer at the top). This 
superimposition of structures shows that only two of the three interacting 
dimers (those at the bottom) can be positioned at adjacent sites on the DNA. 
When the octamer is configured as shown, the third (top dark) and fourth 
(top pale) dimers are appropriately positioned to bind cooperatively to a 
distant pair of operator sites; the model thus shows how a pair of repressor 
dimers bound cooperatively to Og1 and Og? can interact with a second pair 
bound cooperatively to O;1 and O;2, located 2.4 kb away. b, Alternative 
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configuration for the intact repressor octamer. Because of the inherent 
asymmetry of the intact repressor dimer, two DNA-bound repressor dimers 
can be superimposed on the CTD tetramer in two different ways, resulting in 
two different configurations for the intact tetramer (one of which is depicted 
at the bottom, the other at the top). The superimposition depicted at the top 
does not allow for cooperative binding to adjacent operator sites; rather, the 
NTDs fan out in opposite directions. In this model, as well, however, the 
configuration of the three interacting dimers (shown in dark colour) is such 
that only two of the three (those at the bottom) can be positioned at adjacent 
sites on the DNA. c, Schematic of Ox region that illustrates alternate pairwise 
cooperativity. A repressor (CI) dimer bound at Og? can interact with either 
another dimer bound at Og] or another dimer bound at O,3, but not both 
simultaneously. On a wild-type Og region, sites Og1 and Og2 are occupied 
by a cooperatively bound pair of CI dimers (top), whereas when Og] is 
inactivated by mutation, sites Og2 and Ox3 are occupied by a cooperatively 
bound pair of CI dimers (bottom)*. 
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Fig. 2). Although it is likely that the mechanism by which cleavage 
is controlled is similar in these two systems, the higher-order 
interactions between repressor dimers push the equilibrium further 
towards the non-cleavable conformation in the case of  repressor’”. 
This is because the active site residues are buried within the coopera- 
tivity interface when the CTD dimers self-associate to form the coop- 
erativity complex’. 

The unusual quaternary structure of the dimeric repressor pro- 
vides insight into a long-standing problem of repressor function. The 
phage’s Og region consists of three 17-bp operator sites (Og1, Op2 
and O,3), and repressor binds cooperatively to pairs of adjacent 
operators (normally to Og! and Og2, but alternatively to Op2 and 
Ox3 if Og] is inactivated by mutation)’. 

That is, the binding is ‘alternate pairwise’; when the Og region is 
wild type, a dimer bound at Og2 cannot interact simultaneously with 
dimers bound at Og] and O3. Based on the following modelling, we 
suggest that the inherent asymmetry of the DNA-bound repressor 
dimer (Figs 1 and 2) facilitates this pattern of alternate pairwise 
cooperativity, which is essential to the proper functioning of the 
genetic switch. 

We first made a model of the pairwise cooperativity complex, 
which forms when the CTDs of adjacently bound repressor dimers 
self-associate and assemble as a tetramer (dimer of dimers)’. We 
modelled it by superimposing the CTDs of the intact dimeric 
repressor—operator complex onto the structure of the CTD tetra- 
mer’. This superimposition brings the DNA of one repressor— 
operator complex into rough alignment with that of the second 
repressor—operator complex, forming a nearly aligned continuous 
DNA helix (Fig. 3). Although the angular alignment of the operators 
in the model does not precisely match that of the natural operators 
(see Fig. 3 legend), this superimposition of structures provides the 
first plausible model for pairwise cooperativity. In particular, the 
model of the cooperativity complex suggests that when a pair of 
dimers in this conformation binds to adjacent operator sites, the 
CTDsare positioned medially, poised for tetramer formation (Fig. 3). 

To address the issue of alternate pairwise cooperativity, we 
next took advantage of the structure of the CTD octamer'® to model 
the placement of a third repressor—operator complex relative to the 
cooperatively bound dimer pair. That is, we superimposed the CTDs 
of the intact dimeric repressor—operator complex onto the structure 
of the CTD octamer" (Fig. 4). This superimposition can be done in 
two different ways, as depicted in Fig. 4a, b (see legend for details). 
Strikingly, in either case, the third repressor dimer (shown in dark 
colour at the top of each part of Fig. 4a, b) is positioned in such a way 
as to preclude its binding to an operator site located adjacent to the 
pair of operator sites shown at the bottom of each part of Fig. 4a, b. 
Thus, these models make it clear how the geometry of the CTD 
interactions precludes contact between a cooperatively bound dimer 
pair at Og] and Og?2 and an adjacently bound third dimer at O,3. 
Rather, this superimposition of the CTDs of the intact dimeric 
repressor—operator complex onto the structure of the CTD octamer 
depicts the higher-order octameric complex that forms when the 
repressor dimer pair cooperatively bound at Og1 and Og? interacts 
with a similar dimer pair cooperatively bound at O;1 and O,2, 
located 2.4kilobases (kb) away (Fig. 4a)>’’. In conclusion, the 
current and previous structures thus account for both the short- 
and long-range repressor interactions that underlie the regulation 
of lysogeny. 


METHODS SUMMARY 

The A repressor protein with a single amino-acid substitution, D197G, was over- 
produced and purified. The repressor was crystallized with a 20-bp operator 
sequence using the hanging-drop vapour-diffusion method. Crystals grew in 
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space group C222 with the following cell dimensions: a= 74.46 A, 
b= 160.97 A, c= 134.83 A, a= B=y=90° and diffracted to 4A. Data were 
collected at 100K using a 345 MAR research image plate. The structure was 
solved by multiple isomorphous replacement (MIR) methods as implemented 
in SOLVE", and the phases improved by statistical density modification”. 
Previously determined structures of CTD (1F39) and NTD (1LMB) were fitted 
to the density using a genetic algorithm”? and the model was initially refined 
using the rigid-body refinement with the Crystallography & NMR System”. 
Crystallographic and refinement statistics are in Supplementary Table 1. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Codons 1-236 of the 4 repressor with a single amino-acid substitution, D197G, 
were cloned into a derivative of pET 17b (Novagen) to produce a gene that codes 
for a repressor with a non-cleavable His tag at the C terminus. The protein was 
produced in E. coli BL21 and purified using Ni-NTA affinity column (Qiagen). 
The purified ’ repressor D197G was dialysed overnight against buffer that con- 
tained 200mM Tris, 200mM KCl, 50mM EDTA, 3mM DTT (pH 7.4) and 
concentrated to 80 mg ml” '. The sequence of the DNA used in crystallization, 
supplied by the Nucleic Acid Facility, Department of Chemistry, University of 
Pennsylvania, contained 20 bases with a total of 19 complementary base pairs 
with one base overhanging from each strand. Centred on the DNA is the 17-bp 
O;1 sequence, which is the operator that has the highest binding affinity for 1 
repressor: 

5’ TATATCACCGCCAGTGGTAT 3’ 

3’ TATAGTGGCGGTCACCATAA 5’ 

To ensure the integrity of the protein and the DNA used for crystallization, the 
binding of repressor to the operator was confirmed by running a native agarose 
gel (1.4%). The protein-DNA complex was prepared each time before crystal- 
lization. The complex was prepared with the repressor D197G dimer and the 
purified operator in a 1:1.2 molar ratio by diluting the repressor to 10mg ml! 
before mixing. The crystals were obtained by hanging-drop vapour diffusion, 
using 2 ul of the complex solution and 2 il of the well solution. Crystallization 
trays were kept undisturbed in an incubator at 21 °C. The crystallizing condition 
was 14% PEG 4000, 0.1 M sodium acetate (pH 4.6), 0.2 M ammonium sulphate, 
0.05 M magnesium chloride, 20% (v/v) glycerol and 1 mM spermine. Crystals, 
which usually take two to three days to form and about a week to mature, grow 
in space group C222 with the following cell dimensions: a= 74.458 A, 
b= 160.969 A, c= 134.83 A, «= B=y=90° and diffract to 4A both in house 
and at high-energy X-ray sources. Native data were collected at 100 K using a 345 
MAR research image plate mounted on a Rigaku RU-3HR rotating anode gene- 
rator (University of Pennsylvania). Two hundred frames were collected at 1° 
oscillation, 30-min exposure per frame, at 400 mm from the detector. A native 
data set was collected to 3.8 A at 100 K at the National Synchrotron Light Source 
beamline X25 using a charge-coupled device detector. All data were processed 
and scaled using DENZO and SCALEPACK”. 

The structure was solved by MIR. The screening for heavy atoms was greatly 
simplified by using derivatives that were successful for solving the isolated NTD® 
and CTD’. The NTD was phased with K,PtCl, and a multiple anomalous dis- 
placement data set of the CTD was collected from a Hg(OAc) -derivatized 
crystal. To prepare the derivatives, stock solutions of 20 mM of each heavy atom 
were created. One microlitre of heavy-atom stock solution was mixed with 9 pl 
drawn from the mother liquor of a mature crystal to be derivatized to make a 
2 mM solution. An appropriate amount was then added, slowly and carefully bya 
pipette, to the drop containing mature crystals to make the final concentration of 
0.5 or 1 mM in the drop. The coverslip was then returned to the well. 


nature 


Both the mercury and platinum derivative data sets were collected at 100 K to 
4a5A using a 345 MAR research image plate mounted on a Rigaku RU-3HR 
rotating anode generator (University of Pennsylvania). Two hundred frames 
were collected at 1° oscillation, 15-min exposure per frame, at 400 mm from 
the detector. 

The positions of the heavy atoms were determined and preliminary phases 
generated using SOLVE”. The validity of the sites was checked by cross Fourier 
calculations, and the MIR phases were calculated using the CCP4 suite'’; the 
phasing power was 1.33 for mercury and 1.21 for platinum. The initial phases, 
improved by statistical density modification, were used to generate an initial 
electron-density map to 4.5 A. As anticipated, the mercury heavy atoms bound to 
the CTD of the repressor as they did in the original structure. The platinum 
derivative, however, did not occupy the same position in the intact repressor as it 
did in the NTD structure. The CTD and NTD were readily recognizable 
(Supplementary Fig. 1). 

The previously solved structures of CTD (1F39) and NTD (1LMB) were fitted 
to the density using a genetic algorithm we developed specifically for this pur- 
pose*’. The heavy atom sites were then confirmed by calculating a difference 
Fourier transform using phases calculated from the model. The preliminary 
model of the repressor was used to determine the non-crystallographic sym- 
metry of the repressor; the initial phases were further improved by using the 
model as a mask and averaging the density of the NTD and the CTD”. The 
electron density allowed us to trace the polypeptide chain of the CSR using 
the graphics program COOT”. Supplementary Fig. 1 illustrates the fit of the 
cleavage-site region loop to the electron density. 

The structure was initially refined using the rigid body refinement with 
Crystallography & NMR System*!, where the NTDs, CTDs and DNA were 
treated as separate rigid bodies. This reduced the R factor and free R factor 
significantly (Reece 45-39%; R 44-38%). At this point the model for the repressor 
was examined using difference Fourier maps. There were loops in the CTD that 
clearly were displaced; these were corrected. In subsequent rounds of refinement 
at 3.9 A, we used simulated annealing and tight NCS restraints as well as targeted 
restraints on the DNA, which reduced Reece to 35% and R to 29%. New 2F, — F. 
electron-density maps showed clear density for the main-chain atoms but the 
positions of the side-chain atoms were not readily apparent. At this resolution 
the refinement must be heavily restrained as the number of parameters to obser- 
vations is greater than 2; we chose not to push the refinement any further. The 
root mean squared deviation of the backbone atoms of the NTD and CTD dimers 
is less than 0.2 A, and the stereochemistry of the model is consistent with known 
structure. 


22. Otwinowski, Z. & Minor, W. Processing of x-ray diffraction data collected in 
oscillation mode. Methods Enzymol. 276, 307-326 (1997). 

23. Emsley, P. & Cowtan, K. COOT: model-building tools for molecular graphics. Acta 
Crystallogr. D 60, 2126-2132 (2004). 
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hina’s growing research and development (R&D) enterprise is 

something of a paradox. The country is experiencing incredible 

economic growth, largely the result of a manufacturing boom, and 

the government continues to increase R&D funding. But the country’s 
students and young scientists are struggling to cling to their dreams of working 
independently in academia or pursuing their preferred science professions. 
Meanwhile, foreign-trained PhD students are often still favoured over home-grown 
talent (see page 1028). 

Major reforms to China's higher-education system in the late 1990s improved 
its capacity to teach students in a variety of disciplines, and enrolment has risen 
sharply — up from 0.3 million to 13.3 million for undergraduates between 1998 
and 2004. A similar trend can be seen in graduate schools, but graduate studies, 
including those in science and technology, present a stumbling block for career- 
seekers in the form of poor pay. 

Many of China's top students still head overseas, where the United States is 
the most popular destination. A report from the US Council of Graduate Schools, 
released on 14 April, suggests that although graduate-school applications from 
China have slowed somewhat, they are still significant — particularly when 
compared with applications from other nations. 

Other countries also attract plenty of Chinese talent. Around 19% of Canada’s 
science and engineering graduate students were from China in 2004. And of the 
more than 18,000 foreign graduate students enrolled in Japanese universities in 
2005, nearly 10,000 of them were from China. 

Perhaps it is only a matter of time before the Chinese government raises postdoc 
salaries and graduate-student stipends. And maybe there is no rush to stem an 
exodus of students who receive good training and cross-cultural experiences 
abroad — as long, that is, as China finds a way to create jobs to accommodate the 


increasing number who are returning. 
Gene Russo is editor of Naturejobs. 
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CHINA'S 


CHALLENGE 


cientific enterprise is undergoing a much- 

touted expansion in China, fuelled by 

astronomical economic growth and a 

seemingly unlimited supply of talent. But 
graduate students and postdocs in science and 
engineering face numerous hurdles. For many, their 
dreams of becoming an academic researcher seem out 
of reach, owing to problems with funding, training and 
bureaucracy. “If you are really interested in academia, 
it is worth trying. Otherwise, run away,” says Yi Wang, 
a third-year biology graduate student at Peking 
University in Beijing. 

The Chinese government continues to invest heavily 
in science and engineering. In 2006, it increased 
spending on research and development (R&D) to 
1.42% of gross domestic product (GDP). According to 
China’s National Bureau of Statistics, this is a historical 
high. Although the percentage is lower than that of the 
United States or Japan, the total expenditure of around 
300 billion renminbi (US$43 billion), is second only to 
the United States. More than 700 multinational 
companies have built R&D centres in China. 

But limited government support and an excess of 
young researchers have made prospects dim for many. 
The job market has become more crowded as Chinese 
researchers return from overseas. And although the 
government has expanded PhD and postdoc 
programmes in the past few years, there's still a 
shortage of full-time research vacancies. 

Postdoctoral positions were instituted in China in 
1985, largely on the suggestion of Chinese-born Nobel 
physics laureate Tsung Dao Lee. By the beginning of 
2007, China had put a total of 45,000 postdocs through 
the system (see “The rise of the postdoc’). In 2006, the 
latest year for which official statistics are available, 
there were more than twice as many postdocs who were 
in their posts or looking for jobs than in 2002. Some 
40% were in engineering, 20% in science and 10% in 
medicine, according to the 2006 figures. 

Cong Cao, a research fellow at the State University of 
New York, who studies workforce-related issues in 
China, says that roughly 31% of China's science and 
technology postdocs work in basic science and the rest in 
applied science. Although the percentage going to 
industry, government, the military or a second postdoc 
changed little between 2003 and 2006, the percentage 
going to academia dropped, and the percentage of 
postdocs taking other positions increased (see “Where 
they go’). Either unconventional career paths are on the 
upswing, or the market for conventional positions in 
academia and overseas has become saturated — or both. 


Prodigal sons 

The top talent still heads abroad. Only 11% of China’s 
PhDs continue their research in Chinese institutions, 
according to the Ministry of Science and Technology. 
“Postdoctoral positions in China are not as attractive 
as overseas ones in terms of both salary and scientific 
training if the PhDs want to go further in their 
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Young researchers 
in China face stiff 
competition as 
they strive to 
establish labs or 
find other 
scientific careers. 
Wei Zeng explores 
what it takes to 
succeed asa 
Chinese scientist. 
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scientific career,” says Gang Pei, president of the 
Shanghai Institutes for Biological Science (SIBS) and 
president-elect of Tongji University in Shanghai. 

To some extent, the Chinese government encourages 
the flight overseas. Government initiatives such as 
the ‘100 Talents, a Chinese Academy of Sciences 
programme funding 100 scholars per year, typically 
favour those with foreign research experience, says 
Cao. Neuroscientist Yi Rao, who moved back to 
Beijing in 2007 after 24 years in the United States, 
acknowledges this stigma against young scientists who 
have not studied or worked abroad, but suggests that 
things are changing. Rao is dean of Peking University’s 
life-sciences department and deputy director of the 
National Institute of Biological Sciences (NIBS), a 
government-financed institute that opened in 2003. 

More ofa problem, Rao laments, is the lack of an 
intellectual lab culture in many Chinese institutions. One 
of the first things he did upon arrival at Peking University 
was to start a regular seminar series. Most labs in China 
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Science graduate students at Peking 
University are seeking sound career paths. 


are focused on producing papers and results, rather than 
interacting with other labs, Rao says. “Ifinvestigators 
treat science as hard labour, they do not enjoy science,’ he 
says. “And a natural consequence is that their students 
will soon see through their motivation and lose interest” 
Notoriously low stipends exacerbate the difficulty of 
staying in academia. The typical PhD student makes 
between 500 and 1,200 renminbi a month — barely 
enough to survive on in big cities such as Beijing. More 
than 50 biologists and mathematicians recently wrote to 
the government requesting increased support for 
graduate students. The ministries are discussing 
initiatives, but no definite steps have been taken, 
according to Rao, who helped lead the appeal. According 
to China's Xinmin newspaper, Yuliang Yang, head of the 
office of academic degrees committee of the state 
council, told a conference in November 2007 that China's 
PhD students cannot live a decent life normal for people 
in their 20s and 30s. Ata December 2007 workshop on 
graduate education at Peking University, administrators 
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at top Chinese universities expressed similar concerns. 
Taking a second job is rarely an option for dedicated 
graduate students, notes Cao, as they are expected to 
publish two or three papers within three years. Young 
researchers do not fare much better once they get their 
degrees: typical salaries for entry-level PhDs and 
postdocs are 2,000-5,000 renminbi a month, with a rare 
few making thousands more at high-tech companies. 


Supply side 

The government controls the supply of postdoc 
positions, allotting a certain number to each institution 
and often limiting their duration to two to three 

years — a small time window to finish a complex 
research project intended to propel the researcher into 
an independent career. This isn’t a problem everywhere, 
notes Rao — NIBS for example has no time limit. 
Postdoc numbers will continue to increase — the 
Chinese Academy of Sciences runs more than 100 
postdoctoral programmes, which accommodate 20% 
of all Chinese postdocs. Its total budget for postdoc 
support has increased from 1 million renminbi in 1985 
to 3 million today and it plans to support some 4,000 
postdocs by 2010, three times its current number. 

But many still struggle. According to Pei, principal 
investigators at SIBS spend their own grant money to 
support 70-80 postdocs a year and raise the average 
salary. They are also trying to extend the duration ofa 
postdoc in an effort to make their PhDs and postdocs 
more competitive in the job market. 

Weiwei Liu, one of Raos graduate students, plans to 
apply for overseas postdoc positions, but worries about 
her job prospects. “As far as I know, the opportunities 
are limited and the competition is great,’ she says. She 
sees limited opportunities in China in academia and an 
underdeveloped biomedical industry. 

Still, companies could be part of the solution. A few do 
fund university postdocs to address specific research 
problems. Others earn certification to start their own 
postdoc positions. Although this leaves companies 
setting the agenda, Yi Bing Duan, an assistant professor 
at the Institute of Policy and Management at the Chinese 
Academy of Sciences, calls it a potential “triple win’ — 
universities get money, postdocs get support and 
companies get their research questions addressed. Duan 
would like to see postdocs get their own organization, 
something like the National Postdoctoral Association in 
the United States, which would identify postdoc gripes 
and work with the government to find solutions. 

Despite the frustrations, researchers with dreams of 
pursing basic science do persevere. Meng Gao, a PhD 
student in bioinformatics at Peking University, is 
cautiously optimistic. She pictures herself setting up her 
own lab and spending 20 years on one project. “The 
employment situation is especially tough for women 
scientists,” she says. “I know many of them are facing 
difficulties and are planning to give up, or even have left.” 

But there are success stories. Jiang Zhang was recently 
appointed an assistant mathematics professor at 
Renmin University in Beijing after a two-year postdoc 
at the Chinese Academy of Sciences. He enjoys the 
freedom of academia. “I decide when to do what project 
and how to doit,” he says. “The money is not good but is 
acceptable. And I believe not only the money but the 
overall situation in academia will get better. Even if one 
day I go to industry, I still want to do research.” a 
Wei Zeng is a freelance writer for Nature China. 


1029 


LIPING WEI/PEKING UNIV. 


CAREER VIEW 


NATURE|Vol 452|24 April 2008 


MOVERS 


Marja Makarow, chief executive, European 
Science Foundation, Strasbourg, France 


8 


2003-07 Vice-rector, 
University of Helsinki 
2003- (on leave of absence 
until 2012) Professor of 
applied biochemistry and 
molecular biology, University 
of Helsinki 

2000-04 Research director, 
Program in Cellular 
Biotechnology, 

University of Helsinki 


Marja Makarow's early interest in linguistics soon gave way 
to an interest in the sciences after she fell in love with the 
word ‘biochemistry’. But her ability to speak several 
languages and understand multiple fields will serve the 
Finnish biochemist well in her new role as chief executive of 
the European Science Foundation (ESF). 

Makarow is charged with finding new ways to encourage 
cooperation across borders (both political and scientific) to 
support pan-European and interdisciplinary programmes. 
The ESF is not a funding agency, although its 77 member 
bodies include major funding agencies and research 
institutions from 30 countries. Rather, it facilitates 
networking, runs conferences and publishes position papers. 

The University of Copenhagen’s Liselotte Hajgaard, who 
was on the search committee that selected Makarow, cites 
her strong academic background, international experience 
and leadership roles. She recalls seeing Makarow present 
strategies in a new white paper from the ESF's European 
Medical Research Councils. “She was very good at facilitating 
it, in a friendly, kind, polite and still firm way," says Hojgaard. 
"She doesn't come across as a bulldozer. She listens.” 

Makarow studied biochemistry at the University of 
Helsinki, where she did postdoctoral work in virology, and 
went on to the European Molecular Biology Laboratory 
(EMBL) in Heidelberg. She was a group leader at the 
University of Helsinki’s Institute of Biotechnology before 
becoming the institute's research director, an ascent 
interrupted only by a stint at the University of Kuopio. 

Ten years ago, Makarow ventured into policy, serving on 
the Academy of Finland's Research Council for Health, the 
main funding agency for biomedical research. Next, she 
served on the governing body of EMBL and its multinational 
funding organization, a window on the European grant 
funding system. As vice-rector at the University of Helsinki, 
she oversaw research in many scientific disciplines. “When 
you give the little finger, the hand often goes,” says 
Makarow, recounting the many positions she has held in 
addition to running her own lab. 

Hgjgaard uses the word sisu to describe Makarow. “It 
means the ability to go through stone," she says, noting that 
the word often refers to a personality trait that Finns have 
developed during centuries of surviving cold, dark winters 
and holding Russian incursions at bay. “It's a tough job to be 
the chief executive of the ESF, but she can do it. She really 
has sisu." | 
Jill U. Adams 
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MENTORS & PROTEGES 


A guiding COMPASS 


As | contemplated pursuing a PhD 
in cell biology, | looked back at my 
own undergraduate experience and 
decided that scientists in minority 
groups needed a more supportive 
pathway. With the help of associate 
deans and advisers at Carnegie 
Mellon University, | set up COMPASS 
(Coaching Minority Progress and 
Academic Success in Science), a 
mentoring programme led by older 
students that helps incoming minority 
students. That programme now has 
15 members. Its genesis was rooted 
in my own struggles adjusting to a 
rigorous academic atmosphere. 

During my first year as an 
undergraduate, | often felt 
overwhelmed and unsure about my 
abilities as a student. | was lucky to 
meet Malika Jeffries-EL, an African- 
American postdoc in chemistry, whose 
encouragement convinced me that 
my hard work would pay off. Some 
struggling, she said, was natural. She 
and | had much in common. She too 
had been the only black student ina 
science class. She knew the pressure 
to avoid failure. Communicating with 
a woman of colour who had earned an 
advanced science degree helped me 
see a path to success. 

My work improved in my second 
year. My research project, analysing the 
interactions of proteins involved in 


ribosome assembly, was an 
eye-opening experience. | enjoyed 

the various biochemical and genetic 
techniques used to study cellular 
mechanisms; | was stimulated by the 
questions that arose from analysing 
data; and | became interested in solving 
biological problems and addressing 
deeper questions at the PhD level. This 
year | was fortunate to be awarded a 
Howard Hughes Medical Institute 
Gilliam Fellowship for Advanced Study, 
which will fund the entirety of my 
graduate-school experience at the 
school of my choosing. | hope one day 
to become a cell-biology professor. 

My experience of mentoring 
inspired me to start COMPASS, which 
enables incoming minority students 
to network with current students, key 
advisers and faculty members. 
Students are contacted the summer 
before their first year and invited to 
participate. The programme helps 
me share my insights in a supportive 
network. 

This autumn, | plan to study 
regulation of the cell cycle at Stanford 
University. | will continue to serve as a 
mentor and adviser to COMPASS, 
hoping to help young minority 
scientists develop their own paths 
to success. | 
Betty Mbom is a senior at Carnegie 
Mellon University. 


POSTDOC JOURNAL 


In the name of science 


This month | ran more than 20 metres carrying fresh gelada faeces on a rock 
(trying to photograph the defecator) and learned to tell females apart by the 
length of their nipples. It's all in the name of science. | love my job! | am living in 
amonkey soap opera — gelada baboons quarrel and have sex daily, providing 
us scientific voyeurs with hours of data-gathering. 

All| miss is free time. I'd forgotten how much time is consumed by the 
process of gathering and digitizing behavioural data. Still, I'm an absolute nerd. 
| find it exhilarating to see the information increase, and to transcribe gelada 
vocalizations painstakingly recorded through the hazards of windy weather 
and tourist encounters. New ideas are already brewing in my mind, new things 
on which to focus, new questions to ask. (Do rank, spacing and/or hormones 
affect their contact calling?) My brain works in spite of the oxygen deficit! 

Perhaps | am sacrificing some ordinary social skills, though. | am learning to 
identify some 100 geladas by their unique individual quirks or characteristics. 
The downside? | seem to be classifying new people as if they were geladas. 
Just the other day | caught myself comparing one man’s straggly whiskers with 
another's skewed moustache and thinking, ‘These males are easy to tell apart!’ 
| foresee problems readjusting to the human soap opera when | leave the field. m@ 
Aliza le Roux is a postdoctoral fellow in animal behaviour at the 
University of Michigan. 
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SETI for profit 


A capital idea. 


Gregory Benford 


Multi-billionaire Clyde Kraft's announce- 
ment that his team had detected a SETI 
signal shocked the world. He claimed to 
have found a plainly artificial broadband 
signal from a star well over a thousand 
light years away. 

His astronomers ranked 
behind him confirmed the 
discovery. They gave pulse 
spectra, but without labels 
that would let anyone find 
or decipher it. They omitted 
the celestial location or fre- 
quency. Then Kraft stepped 
back to the microphone and 
gave his terms. 

“The real news is that 
we've found a message in 
the megahertz-wide signal. 
It’s intermittently on, hard _ 
to find. But we held steady, _ 
studied the galactic plane for 
over a year.” Kraft paused for 
effect, giving the world a thin 
smile. “Then — there it was! 
In less than a week my fine 
team deciphered it. Not hard, 
really. Its amazing. And ... it’s 
a warning.” 

Consternation. Shouts 
from the press, who thought 
they were attending a busi- 
ness briefing. Kraft wouldn't reveal the 
message or how to find the source. He 
scowled. “I did this as an ... investment. 
Pay me a finder’s fee, say half a billion 
dollars, and we'll talk” With another smile 
he left the dais and vanished from public 
view. 

Around the world scientists rebuked 
him. “To make such a discovery and keep 
it secret violates the code of science,” 
the head of the National Academy pro- 
nounced. “And to expect profit!” 

Parliaments voted to condemn Kraft. 
“The most important discovery in history 
cannot be private property,” declared the 
Pope. 

Most news shows carried the story as a 
scandal, focusing on Kraft’s known cut- 
throat ways of gaining advantage. He had 
used head-to-head methods in stochastic 
arbitrage trading, making him Forbes 2013 
‘King Competitor. Even the Wall Street 
Journal went tsk-tsk. The Times in London 
had a headline three inches high: “What is 
the warning?” 

Kraft wouldnt say. Appeals by politi- 


cians, wise men and philosophers got no 
answer at all. 

Worldwide, astronomers condemned 
Kraft — and got to work. They pored over 
older radio-telescope data, looking for 
sources 1 megahertz wide that came and 
went. There were several and they earned 
front-page coverage. 


SN 


QW 


Dishes trained on these candidates 
listened constantly, to the limits of their 
sensitivity. Kraft would not reveal the dis- 
covery frequency so they scanned through 
their entire range. This was a marked 
change from conventional SETI, which 
had gathered mostly at the ‘water hole’ 
between 1 and 2 gigahertz. 

Observers with telescope time donated 
it to the search for the mysterious source. 
Amateurs erected their own 3-metre radio 
dishes in backyards and got in on the hunt 
with gear they bought over the Internet. 

Kraft’s in-house astronomers were 
sound scientists who had lost out in the 
academic horse race. They refused even to 
discuss the discovery. SETI had been losing 
momentum since 2010, the 50th anniver- 
sary of its first search. Kraft had sought to 
reinvigorate the field, they said, and did so 
with what was to him pocket change, only 
$100 million. 

Scientific professional societies turned 
on these astronomers. Furious executives 
in the AAAS, the IAU and other bodies 
revoked the astronomers’ memberships. 
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Mobs rioted outside their homes. ‘Free 
ET? shouted a popular sign the crowds 
carried in capitals around the world. 

Kraft stayed in hiding. Pressure built. 
The Kraft astronomers also vanished, har- 
ried by incessant demands. 

Months passed. The older observed 
signals yielded nothing. The United 
Nations debated whether it 
could or should confiscate 
Kraft’s holdings within its 
reach — only to find that he 
had moved them into secure 
accounts. Nations began to 
negotiate how to share out 
' the cost of meeting Kraft’s 
price. 

Kraft dodged negotiations. 
Rumours spread that he had 
become a greybearded her- 
mit, like Howard Hughes. 
Reports located him in Swit- 
zerland and North Korea. 

Then the Australian Parkes 
radio telescope picked up 
a quick, millisecond signal 
not far from the line of sight 
towards the galactic centre. It 
was 3 megahertz wide, not 1, 
but it did have a signal. 

Parkes released the pattern 
immediately. Thousands of 
scientists leapt to decode the 
message. Within days they 
had a coherent form, mostly details about 
a solar system similar to ours. It identified 
a race of elephant-like creatures complete 
with trunks, legs and two grasping hands 
at the shoulders. Their society was very 
old and might be extinct. There were hints 
that robots tended the message-sending 
beacon. 

But there was no warning in the mes- 
sage at all. 

With 30 minutes’ notice, Kraft appeared 
at a press conference in Singapore. He 
admitted that there was no warning, the 
bandwidth was different — and in fact he 
had never detected a signal at all. 

“I wanted to goose the world,’ he said. 
“Get ‘em started. SETI needed that a lot 
more than it needed more money. Now 
we have something and it’s time to get to 
work. I'll stand aside” 

He left the room in an uproar. But that 
was nothing when, three years later, he won 
a Nobel, along with Parkes Observatory. m 
Gregory Benford is professor of physics 
and astronomy at the University of 
California, Irvine, and a novelist. 
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